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1 THE VIDEOGRAPHER: Good morning, we're on 

2 the video record. Today's date is September 10, 

3 1997. The time is now 8:38 a.m. 

4 My name is Dave Jenkins, the video 

5 technician, associated with Ray J. Lerschen's & 

6 Associates. Today's witness is Scott Zeger. May we 

7 have the introduction of counsel followed in by the 

8 swearing of the witness. 


Expert Report of Drs. Zeger, Wyant 19 

and Miller (No Bates Numbers) 

Attachment B, Major Smoking 19 

Attributable Diseases 
(No Bates Numbers) 

ICD-9 Description (No Bates Numbers) 227 



9 

MR. SILFEN: I'm Tom Silfen, counsel for 

08:38:26 

10 

Philip Morris from Arnold & Porter. 

08:38:30 

11 

MR. BIERSTEKER: Peter Biersteker from 

08:38:34 

12 

Jones Day, counsel for R.J. Reynolds Tobacco 

08:38:36 

13 

Company. 

08:38:36 

14 

MS. STEURY: Ellen Steury from Arnold & 

08:38:38 

15 

Porter, counsel for Philip Morris. 

08:38:40 

16 

MR. SCHWARTZBAUER: Bob Schwartzbauer from 

08:38:44 

17 

Dorsey & Whitney in Minneapolis, counsel for Philip 

08:38:48 

18 

Morris Inc. 

08:38:48 

19 

MR. HAMLIN: Thomas Hamlin, Robins, 

08:38:50 

20 

Kaplan, Miller & Ciresi, counsel for Plaintiffs 

08:38:52 

21 

State of Minnesota and Blue Cross/Blue Shield of 

08:38:54 

22 

Minnesota. 

08:39:08 

23 



24 



25 




1 SCOTT L. ZEGER, Ph.D. 

2 the Witness in the above-entitled 

3 matter after having been previously 

4 duly sworn testifies and says as follows: 

5 

6 EXAMINATION 

7 BY MR. SILFEN: 

: 39:14 
: 3 9 :16 
: 39:22 
: 39:24 


8 Q. Good morning. Dr. Zeger. I am Tom Silfen, and I am 08 

9 counsel for Philip Morris, and this is the 08 

10 deposition of Dr. Scott Zeger in the Minnesota AG 08 

11 case, at least that's what I call it. 08 
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12 

Now, Dr. Zeger, I'm going to start by 

08:39:28 

13 

marking as an exhibit to the deposition the 

report 

08:39:34 

14 

that I believe you authored, along with Dr. 

Wyant 

08:39:40 

15 

and Dr. Miller. 


08:39:40 

16 

We have a very exotic numbering system 

08:39:46 

17 

here, so I'm not going to try to guess what 

the 

08:39:48 

18 

number is. Does anyone know? 


08:39:50 

19 

Why don't we proceed without it. 

I'm 

08:40:04 

20 

going to mark, and when the time comes, as 

the first 

08:40:08 

21 

exhibit this report which I will hand to the witness 

08:40:10 

22 

and to Mr. Hamlin. 


08:40:16 

23 

Now, what I've given you is the narrative 

08:40:18 

24 

portion of your report and the footnotes. 

The whole 

08:40:22 

25 

report and attachments have been designated 

and 

08:40:26 


7 


1 


identified in an earlier deposition. 


08:40:28 

2 


I didn't want to pile it up here, but 

if 

08:40:30 

3 


you want attachments at any time, they are here 

and 

08:40:32 

4 


we probably will talk about them later. Okay? 


08:40:38 

5 


It's better to answer audibly, even when 

08:40:42 

6 


you would normally just shake your head. 


08:40:44 

7 

A. 

Okay. 


08:40:44 

8 

Q. 

Is this, at least on its face, your report? 


08:40:54 

9 

A. 

Based upon this cursory examination, it appears 

to 

08:40:58 

10 


be my report. 


08:40:58 

11 

Q. 

If at any time it appears that it's something other 

08:41:00 

12 


than your complete, narrative statement, then you 

08:41:04 

13 


will let us know. 


08:41:06 
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14 


Now, we've previously had time to talk to 

08:41:12 

15 


Dr. Wyant and Dr. Leonard Miller. 

08:41:18 

16 


Have you had a chance to read their 

08:41:20 

17 


depositions? 

08:41:22 

18 

A. 

Yes. 

08:41:22 

19 

Q. 

Okay. Have you read any other depositions taken in 

08:41:30 

20 


these litigations in other cases? 

08:41:32 

21 

A. 

Could you repeat that? I'm sorry, in other cases? 

08:41:40 

22 

Q. 

Yes. 

08:41:40 

23 

A. 

No. 

08:41:42 

24 

Q. 

Have you read any depositions in this case other 

08:41:44 

25 


than Doctors Wyant and Leonard Miller? 

08:41:48 


1 A. 

2 Q. 

3 

4 

5 

6 

7 

8 A. 

9 Q. 

10 A. 

11 
12 

13 

14 

15 

16 Q. 


No. 

As you know then from reading the depositions. 
Doctors Miller and Wyant indicated that there was 
some division of responsibility for the report, 
although they said it was a group project. 

What did you consider to be your areas of 
principal responsibility? 

This was a collaborative effort. 

They said so and I'm not doubting that. 

I was involved in all aspects. I didn't do any of 
the computing calculations, myself, but was involved 
in planning and designing the core model and in 
reviewing and finalizing the refined model, and in 
design and review of the work in the nursing home 
model. 

Okay. And when you say refined model, you're 


08:41:48 
08:41:54 
08:42:00 
08:42:04 
08:42:06 
08:42:08 
08:42:12 
08:42:14 
08:42:16 
08:42:24 
08:42:28 
08:42:34 
08:42:44 
08:42:48 
08:42:50 
08:42:52 


http://legacy.library.ucsf.@3iJUijft^i^c|(i6§a0QfipBif.industrydocuments.ucsf.edu/docs/rqfl0001 



17 


including what I think of as the diminished health 

08:42:56 

18 


status model, I take it? 



19 

A. 

Yes. 


08:42:58 

20 

Q. 

Maybe we should get our terms straight since I 

think 

08:43:04 

21 


of the core model in the same terms, I believe. 

that 

08:43:08 

22 


you do. 


08:43:10 

23 


If I say refined disease model, will 

we 

08:43:12 

24 


then be talking about the same thing, which is 

the 

08:43:16 

25 


refined model with respect to lung cancer/COPD 

and 

08:43:20 


9 


1 


CHD/stroke? 

08:43:22 

2 

A. 

Okay. 

08:43:22 

3 

Q. 

And then I'll say the diminished health status model 

08:43:26 

4 


when I mean the diminished health status model. 

08:43:30 

5 

A. 

Okay. 

08:43:30 

6 

Q. 

And I will say the nursing home model when I mean 

08:43:32 

7 


the nursing home model. 

08:43:34 

8 

A. 

Okay. 

08:43:34 

9 

Q. 

Good. Do I take it then that I can generally expect 

08:43:52 

10 


that you will be able to answer questions regarding 

08:43:56 

11 


all aspects of the report? 

08:43:58 

12 

A. 

I'll do my best. 

08:44: 04 

13 

Q. 

I understand. 

08:44: 04 

14 

A. 

I would say that as I did not actually sit at the 

08:44:10 

15 


computer and do calculations either in the core 

08:44:12 

16 


model or in the refined model in its aggregate. 

08:44:18 

17 


there may be questions about the very specific 

08:44:20 

18 


details which I would have to refer you elsewhere 

08:44:28 
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19 

on, but I'll do my best 

to answer 

all 

the 

08:44:28 

20 

questions. 




08:44:28 

21 Q. 

I am probably going to 

go through 

the 

details as I 

08:44:32 

22 

did with Dr. Miller. And if there 

are 

! places 

08:44:36 

23 

where — obviously, if 

there are places where 

08:44:40 

24 

Dr. Miller knows, just 

say so, and 

we 

will shortcut 

08:44:44 

25 

the discussion. 




08:44:44 


1 


10 

With that in mind, did you encounter any 

08:44:50 

2 


places where statements by Dr. Miller were not 

08:44:58 

3 


completely correct to your view or did you approve 

08:45:06 

4 


of everything he said? 

08:45:06 

5 

A. 

Said where? 

08:45:10 

6 

Q. 

In his deposition. 

08:45:12 

7 

A. 

I read his deposition through rather quickly. It 

08:45:20 

8 


was a lot of reading. And I didn't read it to sort 

08:45:22 

9 


of think through every answer and decide whether I 

08:45:24 

10 


agreed or disagreed. So, you know, I can't 

08:45:28 

11 


really — if you wanted to ask me a specific 

08:45:32 

12 


example, I'd be happy to — 

08:45:32 

13 

Q. 

That's fair. 

08:45:34 

14 

A. 

— think it through. 

08:45:34 

15 

Q. 

I was really wondering if you came armed with 

08:45:36 

16 


anything in particular that — 

08:45:38 

17 

A. 

No. 

08:45:38 

18 

Q. 

The same with Dr. Wyant, is there anything in 

08:45:42 

19 


particular that he said that you know right now that 

08:45 : 44 

20 


you have a different view on? 

08:45:46 

21 

A. 

No. 

08:45:46 
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22 Q. 

23 

24 

25 


Okay. I also read Dr. Miller last night and had the 
same experience. It's not easy reading. I say that 
having asked a lot of the questions and I still 
found it hard going. 


08:45:52 

08:45:58 

08:46:00 

08:46:00 


1 

2 

3 

4 

5 

6 

7 A. 

8 Q. 

9 A. 
10 

11 

12 

13 

14 Q. 

15 

16 

17 

18 

19 

20 

21 A. 

22 

23 Q. 


11 


I think that Dr. Miller said something to 
the effect that you were principally involved in 
defining the product. I wrote that down, those were 
his words. 

What's your reaction to that? Were you 
principally involved in defining the product? 

I'm not sure I know what that — 

I'm not sure I know, either. 

I would just go back to what I had said earlier as 
to my role. I think that my previous answer 
describes accurately what my role was. I was part 
of a collaborative team that designed and 
implemented these analyses. 

Okay. I think Dr. Miller also said that you were 
principally responsible for organizing it and 
essentially developing a way to communicate it 
easily. 

I took that to mean that you had had a 
significant role in shaping the presentation. By 
that I mean report; is that correct? 

I had a significant role as did I think Doctors 
Miller and Wyant. 

Did you draft it? 


08:46:10 
08:46:14 
08:46:18 
08:46:20 
08:46:24 
08:46:26 
08:46:28 
08:46:30 
08:46:34 
08:46:36 
08:46:40 
08:46:42 
08:46:44 
08:46:48 
08:46:52 
08:46:56 
08:46:56 
08:47:00 
08:47:06 
08:47:08 
08:47:14 
08:47:20 
08:47:20 
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24 A. 


I drafted sections of it. 


08:47:22 


25 Q. Which sections did you draft? 


08:47:24 


Let's see, I think I drafted the section on — I'm 08:47:30 

not sure if I drafted — I can't remember if I 08:47:34 

drafted the text as it appears now. 08:47:36 

I certainly sketched large sections of the 08:47:40 
core model, the ideas and the idea of a series of 08:47:46 

probabilities applied to the total, fractions 08:47:50 

applied to the total expenditures. 08:47:54 

I think I drafted the section on goodness 08:47:58 
of fit. And there may be some other sections which 08:48:02 


10 


don't come back to me immediately. 

08:48:04 

11 


But I would say that each of us wrote part 

08:48:08 

12 


of it and certainly each of us carefully read and 

08:48:12 

13 


revised and worked together to get a final 

08:48:14 

14 


document. 

08:48:14 

15 

Q. 

All right. Do you know Vincent Miller? 

08:48:48 

16 

A. 

No. 

08:48:48 

17 

Q. 

Do you know who he is? 

08:48:50 

18 

A. 

I saw reference to Vincent Miller in one of the 

08:48:58 

19 


depositions. I don't know who he is other than the 

08:49:00 

20 


mention of the name. 

08:49:02 

21 

Q. 

How about Jeffery Harris? 

08:49:02 

22 

A. 

I know of an economist by the name of Jeffery 

08:49:06 

23 


Harris, who I believe is at MIT. I don't know if 

08:49:12 

24 


that's the Jeffery Harris to whom you're referring. 

08:49:16 

25 


(Mr. Scott entered the deposition room.) 
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1 


MR. 

HAMLIN: Do you know Mr. Scott? 

08:49:20 

2 


MR. 

SILFEN: I don't. 


3 


MR. 

HAMLIN: Basic introductions here. 

08:49:22 

4 


This is Bob Scott, he's one of the counsel for the 

08:49:24 

5 


State of Wisconsin. He's also been admitted pro hac 

08:49:28 

6 


vice here. This is Tom Silfen, Peter Biersteker — 

08:49:34 

7 


MS . 

STEURY: Ellen Steury. 

08:49:34 

8 


MR. 

HAMLIN: I'm sorry, Ellen. 


9 


MR. 

SCOTT: I didn't mean to be a force of 

08:49:38 

10 


test. 


08:49:38 

11 


MR. 

THOMPSON: Tom Thompson. 

08:49:40 

12 


MR. 

SCHWARTZBAUER: Bob Schwartzbauer. 

08:49:42 

13 


MR. 

HAMLIN: This is Professor Scott 

08:49:44 

14 


Zeger. 


08:49:44 

15 


MR. 

SCOTT: Thank you. Sorry for the 

08:49:48 

16 


interruption. 


08:49:48 

17 


MR. 

SILFEN: Why don't we talk off the 

08:49:52 

18 


record and tell me why your northern neighbor is 

08:49:54 

19 


visiting. 


08:49:54 

20 


(Discussion off the written record.) 


21 

BY 

MR. SILFEN: 



22 

Q. 

I take it you' 

re not familiar with work that 

08:50:14 

23 


Dr. Harris has 

done in these cases? 

08:50:18 

24 

A. 

That's correct 

, I'm not familiar with his work. 

08:50:20 

25 

Q. 

Are you aware 

that in this case the defense has 

08:50:28 


14 
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1 


2 A. 

3 Q. 

4 A. 

5 Q. 

6 

7 

8 A. 

9 

10 

11 

12 

13 

14 

15 

16 

17 Q. 

18 

19 

20 A. 

21 
22 

23 

24 

25 


filed expert reports that relate to your report? 

I am aware that expert reports have been filed. 

Have you read those reports? 

I have skimmed through some of them. 

Now, is it your understanding that your group is in 
some respects reworking its report in reaction to 
some of the comments in the defense reports? 

We're doing some additional calculations, most of 
which represents a completion of work which was 
undertaken and most of which is reported here in 
this document. 

I believe there may be one aspect of the 
calculations that are being done partly at I think 
my suggestion and the suggestion was also put 
forward, I think, in one of those reports. 

Although, I don't remember which one. 

Did you have a meeting with Doctors Wyant and Miller 
to discuss what additional work you would do or not 
do in reaction to those reports? 

I recall a meeting where we discussed some aspects 
of — some comments made in those reports. I would 
say the majority of the work that we were attempting 
to complete now is independent of those comments and 
is just trying to finish what was part of our 
original plan. 


08 
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08 
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08 
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: 00 

08 

: 52 : 

: 00 

08 

: 52 : 

: 10 

08 

: 52 : 

: 14 

08 

: 52 : 

: 16 

08 

: 52 : 

: 22 

08 

: 52 : 

: 24 

08 

: 52 : 

: 2 8 


15 


1 Q. Well, if you read, and I know that you did read 08:52:30 

2 Dr. Miller, you know that he, in fact, mentioned 08:52:32 
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that some work was being done in response to those 


08:52:34 


reports? 


08:52:34 


Yes, I can think of — I mean, I don't remember his 08:52:38 
specific comment, but I won't argue that he did say 08:52:42 


that. I don't recall if he said that or not. 


08:52:44 


Okay. When you had this meeting or conversation in 08:52:48 


which the reports were discussed, did you and 


08:52:56 


Dr. Miller and Dr. Wyant prepare for the meeting by 08:53:04 


reading the reports? 


08:53:06 


12 A. Actually, I did not read the reports in preparation 08:53:08 

13 for that meeting. But there was some discussion of 08:53:12 


the reports there. I had, prior to that meeting. 


08:53:14 


skimmed very quickly, ten minutes, for a stack of 


08:53:18 


documents that big. 


08:53:22 


But there was some discussion, and at that 08:53:24 
report — at that meeting there was some, you know, 08:53:26 


a paragraph read here or there. 


08:53:30 


20 Q. Have you done more than skim them now or are you 


08:53:34 


still in the skim state? 


08:53:34 


22 A. 


That's where I am. 


08:53:36 


23 Q. Well, you're in the same state as Doctors Wyant and 08:53:42 


Miller, you're all in the skim state. 


08:53:44 


Let's take a look at your report. 


08:53:56 


MR. SILFEN: Do we have an exhibit 


08:54:00 


number? 


MR. SCHWARTZBAUER: No, we can't find the 


keeper of the numbers. 
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5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


BY MR. SILFEN: 

Q. Well, we will proceed without a number. I will 

refer to your report as your report and we will be 
communicating. 

A. So noted. 

Q. On page 1, the first sentence, do you see that? 

A. Yes. 

Q. The first sentence states. I'll quote it, "We were 
retained by the State of Minnesota (the state)," in 
parens, "and Blue Cross Blue Shield of Minnesota, 
(Blue Cross)," in parens, "to determine the amount 
of money they expended in 1978-1996 to purchase 
smoking attributable health care services." That's 
the end of the quote. 

I will from time to time quote material 
that we both have in front of us and both can read. 
That is being done strictly for the record so that 
someone reading the record will know what we're 
talking about. 

A. I understand. 

Q. It is a bother, but it's kind of the procedure. 
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17 


1 Did you write that sentence? 

2 A. I don't recall. 

3 Q. Does this sentence define the task that you 

4 understood you were undertaking? 

5 A. Yes. 

6 Q. And how was that task defined? How did you decide 

7 this was the task? 
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8 

A 

9 

Q 

10 


11 


12 

A 

13 

Q 

14 


15 

A 

16 

Q 

17 


18 

A 

19 


20 


21 


22 


23 


24 

Q 

25 



Could you — I don't understand the question. 

Well, I assume that when you set out on an 
investigation, you first defined the task; isn't 
that correct? 

Yes, it is as stated here. 

Well, and how was it decided that that was the 
task? 

Well, this was what was asked of us by counsel. 

In other words, in these very words it was described 
to you that this was the task? 

No, these are our words, but we were asked to look 
at the expenditures for this period of time that the 
state or Blue Cross had made that were for health 
care services attributable to smoking. 

And we had a specific period of time and 
that was the task we undertook. 

Do you recall any discussions about the definition 
of the task and whether there were different tasks 
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1 that might have been or should have been 

2 undertaken? 

3 A. No. I was not party to a debate about or discussion 

4 about whether this was the right task. This was the 

5 task that we had been asked to undertake, so that's 

6 what we focused on doing. 

7 Q. Well, did there ever come a time when you questioned 

8 whether another task or another definition of a task 

9 would be appropriate? 



10 

A. 

No. 

08:57:58 

11 

Q. 

I'm looking at the top of the second page, and in 

08:58:50 

12 


the first sentence there's a reference to the major 

08:58:56 

13 


smoking attributable diseases that are used in this 

08:59:00 

14 


report. 

08:59:00 

15 

A. 

Sir, did you say the second sentence? 

08:59:02 

16 

Q. 

The first sentence. 

08:59:04 

17 

A. 

Yes. 

08:59:04 

18 

Q. 

A reference to the major smoking attributable 

08:59:06 

19 


diseases that are used in this report. I am told 

08:59:18 

20 


that the exhibit you have in front of you is number 

08:59:22 

21 


2401. 

08:59:26 

22 

A. 

I feel much better. 

08:59:30 

23 

Q. 

I am greatly relieved, myself. 

08:59:32 

24 


Now, I take it that from the other 

08:59:44 

25 


depositions that the list of smoking attributable 

08:59:50 


19 


1 diseases was given to you by Dr. Samet; is that 

2 true? 

3 A. Could I have that list? 

4 Q. Sure, absolutely. Why don't we mark them both. 

5 This is Attachment B. 

6 MR. SILFEN: I'll ask the reporter to mark 

7 the report as 2401 and Attachment B as 2402. 

8 (Defendants' Exhibits 2401 and 2402 marked for 

9 identification by the reporter.) 


10 

BY 

MR. SILFEN: 


11 

Q. 

Okay. 

I take it now we have marked the 

narrative 

12 


report 

as Exhibit 2401 and Exhibit B as 

Exhibit 
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13 


2402 . 


09:01:40 

14 


My question referred to 

Exhibit B, which 

09:01:42 

15 


the narrative describes as a definition of major 

09:01:48 

16 


smoking attributable diseases; is 

that correct? 

09:01:50 

17 

A. 

Attachment B, yes, is the list of 

major smoking 

09:01:54 

18 


attributable diseases. 


09:01:56 

19 

Q. 

And I understood from Dr. Miller 

and Dr. Wyant that 

09:02:00 

20 


that list was given to you by Dr. 

Samet; is that 

09:02:06 

21 


true? 


09:02:06 

22 

A. 

That's my understanding, yes. 


09:02:08 

23 

Q. 

So you had no part in deciding what the major 

09:02:10 

24 


smoking attributable diseases were? 

09:02:12 

25 

A. 

That's correct. 


09:02:12 


20 


1 

Q. 

The next sentence on page 2 states, and I'll quote. 

09:02:26 

2 


"The state and Blue Cross paid for health care 

09:02:30 

3 


services for more than 90,000 persons suffering from 

09:02:34 

4 


these diseases during 1978 to 1996." 

09:02:36 

5 


Do you see that? 

09:02:38 

6 

A. 

Yes. 

09:02:40 

7 

Q. 

Is it your understanding, as Dr. Miller told us. 

09:02:46 

8 


that those are actually the 90,000 persons whose 

09:02:58 

9 


experience is reflected in these reports? 

09:03:02 

10 

A. 

I'm sorry, I don't understand the question. 

09:03:06 

11 

Q. 

Well, these are the real people whose medical 

09:03:10 

12 


experience is the basis for these reports, correct? 

09:03:14 

13 

A. 

These are some of the people. 

09:03:18 

14 

Q. 

Who are the other people? 

09:03:22 
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15 A 

16 

17 

18 

19 Q 

20 

21 A 

22 

23 

24 Q 

25 A 


1 Q 

2 

3 

4 A 

5 Q 

6 

7 

8 A 

9 Q 
10 

11 

12 

13 A 

14 Q 

15 

16 A 

17 Q 


Well, this report reflects expenditures for anybody 
who received health care services in the period of 
time from Medicaid, GAMC, or Blue Cross/Blue Shield, 
of which these 90,000 are a part. 

Who are the other people that are not covered by the 
state and Blue Cross? 

Well, there might be people who are not suffering 
from the specific diseases listed here that are also 
a subject in this report. 

I see. 

These are 90,000 — well — 


09:03:28 
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09:03:42 
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09:03:48 
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09:03:56 

09:03:58 
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09:04:02 


21 


I see. So these would be the people in this report 
suffering from the major smoking attributable 
diseases? 

That's my understanding. 

Okay. That's fair. That's good. And, in fact, had 
I looked at the next sentence, that's the import of 
the next sentence, I take it, in addition? 

Correct. 

Okay. The middle paragraph on page 2 introduces a 
chart indicating percentages of Minnesotans who had 
a history of smoking. 

Do you see that? 

Yes. 

And what is the significance of that chart to this 
report? 

Could you — I don't quite understand the question. 
You put it in the chart, you put it in the report. 
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18 

19 

20 A. 

21 
22 

23 

24 

25 


and I want to know why it's here. What's its 
importance? 

This Table 1 lists the percentages of Minnesotans 
age 19 and over who during the period '84 to '94 
smoked or had a history of smoking. 

And it shows that for persons covered by 
their employer or union, about 25 percent are 
current smokers and more than 50 percent are current 


09:05:12 

09:05:12 

09:05:18 

09:05:22 

09:05:26 

09:05:32 

09:05:36 

09:05:38 


22 


1 


or former smokers, and that the rate is somewhat 

09:05:42 

2 


higher for persons covered by Medicaid. 

09:05:44 

3 


The significance is that a large fraction 

09:05:46 

4 


of citizens in Minnesota or Minnesotans are either 

09:05:52 

5 


current or former smokers. 

09:05:54 

6 

Q. 

And in — this is what we call in this report 

09:05:56 

7 


sometimes ever-smokers, right? 

09:05:58 

8 

A. 

When you say this is, which — 

09:06:02 

9 

Q. 

Current or former together are ever-smokers. 

09:06:04 

10 


correct? 

09:06:04 

11 

A. 

Yes, that's my understanding. 

09:06:06 

12 

Q. 

And throughout this period, then, what this chart 

09:06:10 

13 


tells us is that more than — something more than 50 

09:06:14 

14 


percent of Minnesotans age 19 and over were 

09:06:18 

15 


ever-smokers? 

09:06:20 

16 

A. 

Yes, that's correct. 

09:06:22 

17 

Q. 

Let's take a look at paragraph 4 at the top of page 

09:07:28 

18 


4. The first sentence says that, "The state and 

09:07:36 

19 


Blue Cross paid substantially more nursing home 

09:07:40 
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20 residence fees for smokers than for never-smokers. 

21 Do you see that? 

22 A. I do see it, yes. 

23 Q. And Dr. Miller has already told us that the word 

24 "Blue Cross" should not have been included, so 

25 we'll just assume that that's out. Okay? 


09 

09 

09 

09 

09 

09 


23 


1 A. 

2 Q. 

3 A. 

4 Q. 

5 

6 

7 A. 

8 Q. 

9 

10 

11 A. 

12 Q. 

13 

14 

15 

16 A. 

17 

18 Q. 

19 

20 
21 

22 A. 


Okay. 09 
Do you agree with that? 09 
That's my understanding. 09 
Okay. Is it your understanding also that that first 09 
sentence relates to the finding of your nursing home 09 
model? 09 
Yes, that's my understanding. 09 
Is it your understanding, also, that that sentence 09 
relates to the same period that we've talked about 09 
in your opening sentence, 1978 to 1996? 09 
Yes, that's my understanding. 09 
And so do I take it you. Dr. Zeger, are telling us 09 
in this sentence that from 1978 to 1996 the state 09 
paid more nursing residence fees for smokers than 09 
for nonsmokers? Is that what you're telling me? 09 
I'm sorry, could you repeat the question once 09 
again? 09 
Are you telling us in this sentence. Dr. Zeger, that 09 
during the period 1978 to 1996, that the state paid 09 
more nursing home residence fees for smokers than 09 
for never-smokers? 09 
This sentence is intended to summarize the results 09 


http://legacy.library.ucsf.@3iJUijft^i^c|(i6§a0QfipBif.industrydocuments.ucsf.edu/docs/rqfl0001 


07 : 44 
07 : 44 
07:46 
07:50 
07:52 
07:56 


07:56 
07:58 
08 : 00 
08 : 08 
08 :12 
08 :14 
08:16 
08:28 
08:34 
08:38 
08 :44 
08:52 
08:56 
09:04 
09:12 
09:14 
09:16 
09:22 
09:36 
09:36 
09:36 
09:44 



23 


of the 

nursing home model. 

24 

Q. 

That' s 

what you already 

told me. 

25 

A. 

The model was estimated 

from NHANES data for a 


09:09:46 

09:09:50 

09:10:00 


24 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 Q. 

17 

18 

19 

20 
21 
22 

23 A. 

24 Q. 


period 1982 to 1992, I believe, an 11-year period. 
Various probabilities necessary to make a 
calculation analogous to what we do for nonnursing 
home expenditures were calculated from NHANES and 
then applied to state expenditures for this period 
1978 to 1996. 

And based upon the calculations, the 
specific calculations that were made, it was 
determined that a certain fraction of the 
expenditures actually made were attributable to 
smoking. 

I believe that's what this sentence is 
intending to summarize. And in order to answer your 
question specifically, I think it would be necessary 
to, you know, look at the details there. 

I'm sure you will not be surprised to find that I do 
not understand your answer. 

Here's my question again: Is it your 
testimony that during the period 1978 to 1996 that 
the state paid substantially more nursing home 
residence fees for smokers than for never-smokers, 
is that your testimony? 

That's what the sentence says. 

I want to know if it's also your testimony. Is that 
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25 


what you're telling us here today? 


09:11:20 


25 


1 

A. 

I don't have any basis now to discount what this 

09:11:26 

2 


sentence says. 

09:11:26 

3 

Q. 

That's kind of an unusual response. This is your 

09:11:30 

4 


report. 

09:11:30 

5 


Is it your testimony that during the 

09:11:34 

6 


period 1978 to 1996, the state paid substantially 

09:11:40 

7 


more nursing home residence fees for smokers than 

09:11:44 

8 


for never-smokers? 

09:11:44 

9 

A. 

I'm hesitating because this does not include a 

09:12:04 

10 


qualification as to the specific dates. 

09:12:06 

11 

Q. 

You made the qualification that there was some 

09:12:10 

12 


extrapolation; subject to that qualification. 

09:12:12 

13 

A. 

Yes . 

09:12:12 

14 

Q. 

The next sentence states, I quote, "Smokers entering 

09:12:24 

15 


nursing homes during this period were far more 

09:12:26 

16 


likely than never-smokers to be suffering from lung 

09:12:30 

17 


cancer and chronic obstructive pulmonary disease." 

09:12:34 

18 


Do you see that? 

09:12:34 

19 

A. 

Yes. 

09:12:36 

20 

Q. 

Are you familiar with the computation that supports 

09:12:42 

21 


that statement? 

09:12:42 

22 

A. 

I'm generally familiar with it. 

09:12 : 44 

23 

Q. 

What do you mean by generally familiar? 

09:12:48 

24 

A. 

I had a discussion with Dr. Wyant and have briefly 

09:12:56 

25 


reviewed a table that he prepared from which this 

09:13:00 
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26 


1 


sentence was drawn. 

09:13:04 

2 


He was responsible for drafting this 

09:13:06 

3 


sentence and preparing that table, but I have 

09:13:12 

4 


briefly reviewed it. 

09:13:12 

5 

Q. 

And does this also reflect your testimony, this 

09:13:18 

6 


sentence? Do you believe this to be correct? 

09:13:20 

7 

A. 

Yes. 

09:13:20 

8 

Q. 

Did you review this table prior to the writing of 

09:13:24 

9 


this report or did you review it after? 

09:13:26 

10 

A. 

I think both. I think both times. I'm not — I 

09:13:30 

11 


don't recall specifically. 

09:13:32 

12 

Q. 

Paragraph C-6 on page 4 you report the percentage of 

09:14:08 

13 


smoking attributable expenditures for Blue Cross, 

09:14:10 

14 


for GAMC and for Medicaid, do you see that? 

09:14:14 

15 

A. 

Yes. 

09:14:18 

16 

Q. 

And there is a difference between the percentages. 

09:14:26 

17 


14.9 percent for GAMC, 10.7 percent for Blue Cross, 

09:14:32 

18 


and 4.9 percent for Medicaid. 

09:14:34 

19 


Do you see that? 

09:14:34 

20 

A. 

Yes. 

09:14:36 

21 

Q. 

Have you given any consideration to why the 

09:14:40 

22 


attributable percentages should be so different? 

09:14:48 

23 


For instance, why the GAMC percentage should be 

09:14:54 

24 


three times the Medicaid percent? 

09:14:56 

25 

A. 

I've given no specific consideration to that. 

09:15:00 


27 
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1 Q. 

2 A. 

3 Q. 

4 

5 A. 

6 Q. 

7 

8 

9 A. 

10 Q. 

11 A. 

12 

13 

14 

15 

16 

17 

18 

19 

20 Q. 

21 
22 

23 

24 

25 A. 


What would be the explanation for such a result? 

I can only speculate here. 

I understand. I'm just wondering what you would 
attribute it to, whether it's a concern? 

It's not a concern. 

If it's not a concern, then you must have an 
explanation. What is the explanation? In other 
words, you're not startled and horrified? 

Correct. 

So what is the explanation? 

Well, these are different populations of people, 
they're different ages, they have different 
conditions, they're different in many ways. 

And the whole idea of our approach to 
modeling was to allow the smoking attributable 
fractions to depend on things like what disease you 
have and how old you are and so forth. 

And I'm assuming that these populations 
are different in those respects. 

So you're saying that it's — the critical factor 
here is the different populations so that, for 
instance, you are not surprised to see a different 
smoking attributable fraction for a Medicaid 
population? 

These are smoking attributable expenditures. 
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28 


1 Q. Well, that's a fraction. Those aren't dollar 

2 numbers, those are fractions? 

3 A. Yes, but it's the fraction of expenditures. 
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4 

5 

Q. 

All right, you're not surprised to see a 

different — 

09:16:26 

6 

A. 

That's correct, I'm not surprised. 

09:16:26 

7 

Q. 

And that's because the Medicaid population is 

09:16:28 

8 


different from the GAMC population? 

09:16:36 

9 

A. 

I said it may be. You asked me to speculate; I 

09:16:38 

10 


did. I gave you my best speculation. 

09:16:38 

11 

Q. 

It was good speculation. 

09:16:44 

12 

A. 

Thank you. 

09:16:44 

13 

Q. 

On page 5, paragraph 9, there is discussion of the 

09:17:06 

14 


confidence intervals. Is your work on confidence 

09:17:16 

15 


intervals now complete? 

09:17:18 

16 

A. 

I don't know. 

09:17:18 

17 

Q. 

You're not doing it? 

09:17:20 

18 

A. 

That's correct. 

09:17:22 

19 

Q. 

Dr. Leonard Miller is doing it? 

09:17:26 

20 

A. 

No. 

09:17:26 

21 

Q. 

Dr. Wyant is doing it? 

09:17:30 

22 

A. 

Yes . 

09:17:30 

23 

Q. 

I was afraid you were going to say no, and then I 

09:17:32 

24 


was going to not know who to turn to. 

09:17:36 

25 


Now, I may have this wrong, but I believe 

09:17:52 


29 


1 

2 

3 

4 

5 


A. 

Q. 


that Dr. Miller said that he had been doing the work 
on confidence intervals, is that not right? 

I don't know what he said. I don't remember. 

Well, but who historically has been doing the work 
on confidence intervals? 
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7 


8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 Q. 

22 A. 

23 Q. 

24 

25 A. 


Well, the three of us met and discussed the approach 09 
to calculating confidence levels and agreed upon an 09 

approach. And Dr. Wyant and Dr. Miller have been 09 

doing the computations all along. 09 

I believe Dr. Wyant is doing the 09 

computations of confidence intervals for the core 09 

model. And I think Dr. Wyant and Dr. Miller working 09 
together are doing the computations for the refined 09 
model. 09 

But I think it would be fair to say that 09 

Dr. Wyant is sort of trying to get this thing 09 

finished up, leading the effort. 09 

We all, of course, participated in the 09 

design of the approach to calculating confidence 09 

levels. 09 

Is your report significant? 09 

I don't know what you mean. 09 

Well, is it — does it have a confidence interval 09 

that passes through zero or 1 or whatever — 09 

The results are not completed yet. 09 


30 


1 Q. So you do not know at this day whether all or parts 09 

2 of your report pass a test of significance? 09 

3 A. I don't know whether they do; I don't know whether 09 

4 they don't. I don't have any information about 09 

5 that, so I can't answer that question. 09 

6 Q. I understand. Well, suppose a part didn't pass a 09 

7 significance test. You're doing a jackknife, is 09 

8 that correct, or that's what I heard originally? 09 
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A. 


Q. 


A. 

Q. 

A. 

Q. 

A. 


We're using the statistical method of jackknifing, 
yes, to make confidence testing. 

I won't dare ask you what that is, but let me leap 
to the bottom line. 

Suppose that the core model doesn't pass 
the test, that the confidence interval includes one, 
if that's the proper terminology. Would that be the 
proper terminology? What would flunking be? 

Describe to me flunking. 

We're not doing any test, so there's no passing or 
failing. 

You're not doing a test? 

That's correct. 

So we won't come out with something comparable to a 
passes significance at a 5 percent level? 

We're calculating a confidence interval, which is to 
say that we will report an estimate of the smoking 
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1 attributable expenditures for the period 1978 to 

2 1996. 

3 And then we will give an interval of 

4 values that reflect the uncertainty in our overall 

5 estimate. 

6 Q. But you will nowhere produce what I think of as a 

7 standard measure of significance? Won't that be 

8 something that falls out of that project? 

9 A. It is possible in calculating a confidence interval 

10 to perform a statistical test by asking whether a 
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11 

12 

13 Q. 

14 A. 

15 

16 

17 

18 Q. 

19 

20 A. 

21 
22 

23 Q. 

24 

25 


particular value falls within or outside of that 
confidence interval. 

Yes. 

But our plan right now is to calculate that 
confidence interval. And I don't have anything else 
to, you know, to say about whether any particular 
test will be done or not. 

Well, was it ever your plan to do tests of 
significance? 

Tests of significance have been used in some of the 
work that's reported here, but it's not been my 
intention to test any particular hypothesis. 

So you are not going to report the results of those 
tests of significance? 

MR. HAMLIN: Objection; asked and 
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1 


answered. 

09:21:54 

2 


MR. SILFEN: I don't think so, Tom. I 

09:21:56 

3 


mean, if you're making a work product objection — 

09:21:56 

4 


MR. HAMLIN: Asked and answered 

09:21:58 

5 


objection. 

09:21:58 

6 


MR. SILFEN: I didn't know that I had 

09:22:00 

7 


the — unless the answer is no, I perceived the 

09:22:02 

8 


answer to be no. 

09:22:04 

9 

BY 

MR. SILFEN: 


10 

Q. 

I perceive you to be saying you are not going to 

09:22:06 

11 


report a test of significance; is that correct? 

09:22:08 

12 

A. 

That's not what I said. 

09:22:10 

13 

Q. 

Well, then what did you say? 
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14 A. 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


What I said is that we are going to report a 
confidence interval, which is an expression of the 
uncertainty in the point estimate of the smoking 
attributable expenditures. 

And then I said that it is possible, 
having received, having obtained a confidence 
interval, for the reader to conduct their own test 
or for the writer to have a test which says, "Does a 
particular value fall within or outside that 
interval?" 

And that would be a valid test of 
hypothesis. But what we plan to report — and what 
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1 we plan to report is a confidence interval for the 

2 smoking attributable expenditures. 

3 Q. And why wouldn't you also take the next step and 

4 report on the significance of the result? 

5 A. The word "significance" is a vague word and it has 

6 many different meanings to different people. You 

7 haven't specified a particular hypothesis. 

8 There are many hypotheses, I mean, that 

9 one might test and report a significance for. So is 

10 there a specific hypothesis that you're referring 

11 to? 

12 Q. In paragraph 11 at the bottom of page 5, you talk 

13 about adjustments that are made for characteristics 

14 of smokers. Do you see that? 

15 A. Paragraph 11 on the bottom of page 5? 
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16 

Q. 

Yes . 



09:24:34 

17 

A. 

Yes . 



09:24:34 

18 

Q. 

And you say at the end of 

that 

paragraph that it is 

09:24:38 

19 


unlikely that any further 

adjustments would 

09:24:44 

20 


materially change our smoking 

attributable 

09:24:46 

21 


expenditure estimates? 



09:24:48 

22 

A. 

Yes. 



09:24:50 

23 

Q. 

Is that your view? 



09:24:52 

24 

A. 

Yes. 



09:24:52 

25 

Q. 

And what is the basis for 

that 

view? 
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1 A. 
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3 Q. 

4 A. 
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10 Q. 

11 

12 A. 

13 

14 
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18 


34 

The Surgeon General's Report in 1989 is one basis 
for it. 

Explain to me why. 

That report has a sentence very much to this effect 
saying that even though there's been an effort to 
look at other explanations for the smoking effect on 
disease and smoking-caused diseases there has been 
very little change in the estimated smoking effects 
from those efforts to do additional control. 

Do you have any other support for that theory, for 
this sentence? 

Yes. In addition to the 1989 Surgeon General's 
Report, there are a few other pieces of evidence 
which we base this on. 

One is the book by Will Manning in which 
he did have available to him information on alcohol, 
the data sets he used, and estimated his models, his 
medical expenditures, controlling for all of the 
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20 
21 
22 

23 

24 

25 


variables he had, including alcohol and exercise, I 
believe, and then not controlling for any of them. 

And his result changed, I think, by 10 or 
15 percent. And that's when he excluded all of the 
variables, such as socioeconomic status and 
everything, changed about 15 percent. So that's the 
second piece of evidence. 
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1 Q. 

2 A. 

3 

4 

5 

6 

7 

8 Q. 

9 

10 A. 

11 
12 

13 

14 

15 

16 

17 

18 

19 

20 


And is there a third? 

Yes, there is a third. The similarity in our 
results from the core model in which we didn't 
control for factors like socioeconomic status and 
education and the refined model where we did control 
for those factors were quite similar to one 
another. And that's a third piece of information. 

Do you have another basis or do we have them all 
now? 

Let's see. Yes, there is one more that comes to 
mind. There's a paper published from the Framingham 
group that I recall reading, let's see if I can 
recollect, I think it's in the "American Journal of 
Medicine" where they looked at the Medicare 
expenditures for their Framingham population and 
looked at a series of risk factors for 
cardiovascular disease, of which smoking was a major 
one. 

There were three, I believe, smoking being 
a major one. And estimated the health care. 
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estimated the smoking effect or these cardiovascular 
risk effect on cardiovascular-related expenditures, 
and then controlled, in addition to that, for 
exercise and alcohol and other things and found none 
of those variables to affect the result. 
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1 Q. Have you done this research, yourself, to establish 

2 these four factors? 

3 MR. HAMLIN: Objection to form. 

4 THE WITNESS: I'm not sure I understand 

5 the question. 

6 BY MR. SILFEN: 


7 Q. 

8 
9 

10 

11 

12 

13 A. 

14 

15 

16 

17 

18 

19 

20 Q. 

21 
22 

23 A. 


You gave me four reasons for the view expressed in 
paragraph 11. 

And what I'm asking you: Is this the 
result of your research, or is this an answer to 
this question that has been supplied to you by other 
members of your group? 

No, it was my answer to your question. Some of that 
work which I've just described was done by me in 
collaboration with Doctors Wyant and Miller, that 
would be the calculation of the core model and the 
refined model. 

The other three citations were citations 
to previous research, not to my research. 

I understand that. But that's previous research 
that you're familiar with personally and you have 
read? 

Yes. 
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1 A 

2 Q 

3 

4 A 

5 Q 

6 

7 A 

8 

9 Q 
10 
11 

12 A 

13 

14 

15 
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17 Q 

18 

19 A 

20 Q 

21 

22 A 

23 Q 

24 

25 A 


And you've read the 1989 Surgeon General's Report or 
the pertinent parts? 
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09:30:38 
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Yes. 

And the 
chapter 
I don't 
Did the 


statement you're referring to is from 
3, is it not? 
recall the chapter. 

work in chapter 3 control for anything at 


all? 

I don't recall the chapter, so I don't know what's 
in chapter 3. 

Did the work in the Surgeon General's Report on the 
risks of smoking control for anything at all, 
anything? 

I don't recall the specifics reported there. The 
sentence I cited was a statement about the effort to 
control. As everything in the Surgeon General's 
Report, it was a reference to work in the 
literature, not to their own work. 

Can you recall what work was referenced in that 
sentence? 

I do not. 

Did you ever look? Did you ever try and find the 
work that's referenced in that sentence? 

All I recall is the sentence at this point. 

So the answer is you don't recall ever looking to 
see if there was anything actually referenced? 

From that particular sentence? 
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1 

Q. 

Yes . 

09:31:36 

2 

A. 

No. 

09:31:38 

3 

Q. 

Had you read Dr. Manning's work apart from your — 

09:31:56 

4 


this task here in Minnesota, or did you read it in 

09:32:00 

5 


connection with this Minnesota work? 

09:32:02 

6 

A. 

I read it in connection with this work. 

09:32:06 

7 

Q. 

So you're familiar with Manning's methodology? 

09:32:10 

8 

A. 

It's been a while, but I'm somewhat familiar. 

09:32:14 

9 

Q. 

Did he define the task the same way that you have? 

09:32:24 

10 

A. 

What task are you referring to? 

09:32:28 

11 

Q. 

The task that you have defined in the first sentence 

09:32:34 

12 


of your report. 

09:32:36 

13 

A. 

I don't think his task was to establish the smoking 

09:32:42 

14 


attributable expenditures for the State of 

09:32:46 

15 


Minnesota, no. 

09:32:46 

16 

Q. 

Well, forget for a moment the State of Minnesota. 

09:32:54 

17 


Was his task to establish the smoking attributable 

09:32:56 

18 


expenditures for the population generally? 

09:33:00 

19 

A. 

I believe the — no, I would say no. 

09:33:08 

20 

Q. 

What was his task? 

09:33:10 

21 

A. 

I believe his main goal was to look at questions of 

09:33:12 

22 


excise taxes and the relationship of taxes to 

09:33:20 

23 


internal and external costs, largely external costs. 

09:33:26 

24 


of smoking. 

09:33:28 

25 

Q. 

I'm not sure that's a distinction. I mean, you're 
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39 
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8 A. 
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Q. 


A. 


Q. 


A. 


A. 


Q. 


right, he was trying to see if smoking pays its way, 
but in order to do that, of course, he had to figure 
out what smoking costs, right? 

He looked at a — yes. 

And the fact that he was looking at external costs 
doesn't distinguish your work here because you're 
also looking at external cost, right? 

I don't recall his specific definition of external 
and internal, so I'm not sure. 

Yes, but the fact is you're looking at external 
costs, too? 

If you define "external costs" for me, I could 
answer that question. 

You don't know what external costs are? You used 
the term first. 

I was trying to refer to the word "external costs" 
as described by Manning, and I don't recall his 
specific definition at this time. 

Well, he surely included costs that were borne by an 
insured group and not by the individual who was 
sick, let's say? That's clearly part of his 
external costs, isn't it? 

The costs borne by an external group were certainly 
part of his external costs? 

Yes. 
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1 A. To the best of my recollection, yes. 


09:34:54 
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2 Q. 

3 A. 
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6 Q. 

7 

8 A. 
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10 

11 

12 Q. 

13 

14 

15 

16 A. 

17 

18 

19 Q. 

20 

21 A. 

22 

23 

24 

25 Q. 


That's what you've been measuring here, right? 

We tried to estimate the fraction of the dollars 
expended by the State of Minnesota that's 
attributable to smoking. 

Tell me what's the difference in his task and yours, 
as he defined it? 

I don't remember his specific task as he defined 
it. I've given you a sense of involving something 
about excise taxes. I don't remember the specifics 
as he's formulated it. 

Okay. On page 6 in paragraphs 14 and 15 you say 
that your analysis goes beyond methodological and 
data limitations of previous analyses. Do you see 
that? 

Let's see, 14 is our analysis goes beyond major 
methodological limitations, and 15 beyond the major 
data limitations, yes. 

What particular prior analyses did you have in 
mind? 

One prior analysis is the model used and reported in 
the MMWR that was done by, I believe, Bartlett and 
Miller and others. And I believe that's one of the 
prior works that's being described here. 

Any other? 
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1 A. I think there's probably also reference to a 

2 previous work done for the Centers for Disease 

3 Control, also which I think it goes under the name 

4 SAMMEC. 
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Q 
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A 

7 

Q 
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A 
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10 


11 


12 


13 


14 


15 


16 

Q 

17 


18 

A 

19 

Q 

20 


21 


22 

A 

23 

Q 

24 


25 

A 


Why is your work superior to SAMMEC? 

To SAMMEC, sir? 

Yes . 

This is to the best of my understanding. SAMMEC was 
developed using indicators of resource utilization, 
such as hospital visits and the like, as opposed to 
specific expenditure data. 

I think it was based upon the I think the 
1989 health interview survey. Whereas we were able 
to actually use real expenditures for actual 
diseases. That's to the best of my recollection. 
SAMMEC, does SAMMEC control for anything other 
than age and gender? 

I don't recall what else they control for. 

Are you familiar with SAMMEC because of your work 
for Minnesota or because it's just something you're 
familiar with in the field? 

It's through my work on this project. 

You had, however, done some smoking-related studies 
in the past, hadn't you? 

One. 
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1 Q. What was that one? 

2 A. I actually was involved in a study looking at the 

3 price elasticity of tobacco use using state data. 

4 It was actually a project done by student 

5 — the masters of public health students at Johns 

6 Hopkins. And I was, again, involved in helping in 
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14 
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16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


this study. 

Q. That's the only tobacco-related work you had done? 

A. I can't recall any other specific examples. 

Q. I thought I saw some others, but we'll take a look 
later. 

MR. HAMLIN: Tom, when you get to a point, 
I'd like a break. 

MR. SILFEN: This is fine, we can take a 

break now. 

THE VIDEOGRAPHER: We're temporarily going 
off the video record. The time is now 9:39 a.m. 

(Off the record.) 

THE VIDEOGRAPHER: We're back on the video 
record. The time is now 9:46 a.m. 

BY MR. SILFEN: 

Q. All right, continuing the deposition. You mentioned 
the model that was published in the MMWR, correct, 
the Miller/Bartlett model? 

A. Yes. 
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1 Q. Are you familiar with that model? 

2 A. Not very familiar with it. 

3 Q. When did you start, as best you recall, working on 

4 this Minnesota Project? 

5 A. October — I think end of October 1996. 

6 Q. When you arrived, did you then have input in the 

7 shaping of the project? I'm thinking particularly 

8 of your statement earlier this morning that you had 

9 a major role in deciding on the core approach, the 
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three reductions? 


09:47:26 


11 A. Did I have — I'm sorry, repeat the question. 

12 Q. I guess what I'm saying is almost tautological. You 

13 told us that you had a role in deciding on the three 

14 reductions methodology, correct? 

15 A. Yes, I had a role in formulating the core model, 

16 which involves three reductions. 

17 Q. But the three reductions are used in, as you related 

18 in this report, in all your models; isn't that 

19 true? 

20 A. Yes. 

21 Q. So this core structure of three reductions flows 

22 through the whole report, does it not? 

23 A. Correct. 

24 Q. Now, I take it, then, that that three reductions 

25 structure was not in place when you arrived in 
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1 


October of 1996; is that true? 

09:48:16 

2 

A. 

No. 

09:48:18 

3 

Q. 

It was in place? 

09:48:20 

4 

A. 

Aspects of it were in place. 

09:48:22 

5 

Q. 

Well, what was it that you contributed then? 

09:48:30 

6 

A. 

I contributed a number of things, including 

09:48:34 

7 


proposing that we do a core simplified analysis. 

09:48:42 

8 


making the three reductions explicit. 

09:48:48 

9 


Those reductions are inherent in the 

09:48:50 

10 


refined model, although, not made explicit in the 

09:48:58 

11 


way they are during the core model, at least they 

09:49:04 
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12 


were inherent in the calculations that were being 

09:49:06 

13 


done when I arrived, you know, the work 

on the model 

09:49:10 

14 


when I arrived. 


09:49:10 

15 


But one of my contributions was to help 

09:49:12 

16 


make explicit their use. 


09:49:14 

17 

Q. 

And what were your other contributions? 


09:49:20 

18 

A. 

You know, we have worked as a team since 

that date. 

09:49:28 

19 


All decisions made about all models are 

really based 

09:49:32 

20 


upon conversations that we had with each 

other. 

09:49:34 

21 

Q. 

When you arrived, was the model disease 

specific, as 

09:49:46 

22 


it is today? 


09:49:46 

23 

A. 

The refined model? 


09:49:48 

24 

Q. 

Any part of the work that was ongoing. 


09:49:50 

25 

A. 

The work on the refined model, I believe 

, was 

09:49:54 


45 


1 


disease specific by November when I arrived. It's 

09:50:00 

2 


my understanding that Dr. Samet played a large role. 

09:50:06 

3 


and he had arrived, I don't know how much earlier. 

09:50:08 

4 


but somewhat earlier than I had. 

09:50:10 

5 

Q. 

I see. Dr. Samet is also at Johns Hopkins, 

09:50:16 

6 


correct? 

09:50:18 

7 

A. 

Yes . 

09:50:18 

8 

Q. 

Have you worked with him previously? 

09:50:22 

9 

A. 

Yes . 

09:50:24 

10 

Q. 

On what projects? 

09:50:26 

11 

A. 

We collaborate on a project looking at the mortality 

09:50:34 

12 


effects of particulate air pollution. 

09:50:38 

13 

Q. 

I take it you mean this is an ongoing work that you 

09:50:42 

14 


do with him? 

09:50:42 
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15 A 

16 Q 

17 A 

18 Q 

19 A 

20 

21 Q 

22 

23 

24 

25 


1 A 

2 Q 

3 A 

4 Q 

5 

6 A 

7 Q 

8 
9 

10 

11 

12 

13 A 

14 Q 

15 

16 


That's correct. 

Is that the only project you have shared with him? 
To the best of my recollection, yes. 

Who is that work being done for? 

The work is currently funded by the Health Effects 
Institute in Boston. 

Okay. All right. Let's take a look at the three 
reductions, and I am going to look at page 7, and I 
am going to also look at the same time at pages 9 
and 10 where you have an example. 

Are you familiar with that example? 


09:50:42 

09:50:46 

09:50:50 

09:50:52 

09:50:56 

09:51:00 

09:51:06 

09:51:10 

09:51:16 

09:51:20 

09:51:22 
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Yes. 

Did you construct it? 

With Doctors Wyant and Miller. 

I have this picture of the three of you kind of 
huddled around the table? 

We're very close friends. 

Good to hear that. All right. 

The first reduction, "How many smokers?" 

I take it this is take into account the simple fact 
that some of the disease occurs among nonsmokers, 
and the disease incurred by nonsmokers is not part 
of the smoking attributable cost? 

That's correct. 

Okay. And so in the simple example that you have on 
page 9, there are 160 total lung cancers, 20 of them 
are among never-smokers so that you, in the first 
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17 step, would reduce the attributable dollars by the 

18 20, or a proportion constructed of the 20? 

19 A. In the first step we would reduce the expenditures 

20 by the fraction .875, which would be 140 divided by 

21 160, leaving out the 20 nonsmokers. 

22 Q. Thank you. That's a better way to say it. 

23 Now, the second reduction is titled, "How 

24 much extra disease?" Do you see that on page 7? 

25 A. Yes. 


09:52:44 
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1 Q. 

2 A. 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 Q. 

16 

17 

18 A. 

19 


What do you mean by how much extra disease? 

Well, what that refers to is that even in a 
population of smokers, even in a population — even 
with people who have lung cancer among smokers, that 
we would expect some of them to have had lung cancer 
— well, we would expect a certain baseline rate of 
lung cancer among an otherwise similar population of 
nonsmokers. 

So we don't want to attribute all of the 
lung cancers among the smokers to their smoking, as 
we recognize that a certain fraction occur even 
among similar nonsmokers. 

So this second reduction is attempting to 
set aside those. 

And that baseline is the amount that you assume the 
smoking population would have occurred even had they 
never smoked, as you put it here; is that correct? 

I would word it slightly differently. I would say 
it's the increased rate of lung cancer in the 
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20 


smoking population over what would occur in an 


09:54:54 


21 otherwise similar nonsmoking population. 09:54:58 

22 Some people like to talk about this 09:55:02 

23 hypothetical group not smoking. It's simpler for me 09:55:08 

24 to think about an otherwise similar nonsmoking 09:55:12 

25 populations. 09:55:12 


48 


1 

Q 

2 

A 

3 

Q 

4 

A 

5 

Q 

6 


7 

A 

8 


9 

Q 

10 


11 


12 

A 

13 

Q 

14 


15 


16 

A 

17 

Q 

18 


19 


20 

A 

21 

Q 


What did you mean when you said "baseline"? 

You'd have to — 

You used the word "baseline." 

I don't remember how I used it. 

What do you mean — you said a baseline rate, what 
do you mean by that? 

What I mean is the rate that would occur in an 
otherwise similar group of nonsmokers. 

Are you familiar with the term expected in 
epidemiology, the expected rate, and the observed 
rate, do you know what those are? 

Not as you're using them, no. 

Well, would the baseline rate be the rate that you 
would expect in a population of smokers if they 
weren't smokers? 

I'm sorry, could you repeat that? 

Would the baseline rate be the rate of lung cancer 
that you would expect in a population of smokers if 
they weren't smokers? 

That's not the way I use the word baseline, no. 

So you — do you have a different view? I mean. 
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22 


read a sentence from page 7 here, it's the second 


09:56:20 


23 full sentence, "We calculate how often the smokers 09:56:22 

24 being treated got the disease above and beyond the 09:56:26 

25 rate at which they would have gotten it anyway, had 09:56:30 

49 


1 


they never smoked." 

09:56:30 

2 

A. 

I was clarifying the way I would put that idea. You 

09:56:40 

3 


know, it's a semantic, it's somewhat semantic the 

09:56:44 

4 


difference. I'm not going to — well, I would tend 

09:56:52 

5 


to write it in the way that I stated it, comparing 

09:56:56 

6 


otherwise similar populations. 

09:56:58 

7 


This is another way some would write it. 

09:57:00 

8 


And obviously the person who wrote it here wrote it 

09:57:02 

9 


that way. I don't remember who wrote that 

09:57:04 

10 


sentence. 

09:57:06 

11 

Q. 

Well, but I understood that you were principally 

09:57:08 

12 


involved with the core model and in particular with 

09:57:12 

13 


these reduction steps; isn't that true? 

09:57:14 

14 

A. 

Yes. 

09:57:14 

15 

Q. 

So I take it you closely reviewed this 

09:57:16 

16 


presentation? 

09:57:18 

17 

A. 

Yes. 

09:57:20 

18 

Q. 

In fact. Dr. Miller said you were principally 

09:57:24 

19 


involved with the communication in the presentation 

09:57:26 

20 


of just this, these steps; isn't that correct? 

09:57:30 

21 

A. 

I don't know what he said. I don't recall what he 

09:57:32 

22 


said. 

09:57:32 

23 

Q. 

Look at the final sentence of the paragraph, "We 

09:57:36 

24 


then reduce the total expenditures a second time to 

09:57:38 
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25 


remove the expenditures for diseases that would have 09:57:42 


50 
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2 

A 

3 

Q 
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A 

5 

Q 

6 


7 

A 

8 


9 

Q 

10 


11 

A 

12 


13 

Q 

14 

A 

15 

Q 

16 


17 

A 

18 


19 


20 


21 


22 

Q 

23 


24 


25 

A 


occurred anyway." Do you see that? 

Yes, I do. 

Do you agree with that? 

Yes. 

You said a moment ago that the distinction you were 
drawing was semantic, correct? 

I don't recall exactly what I said, but something to 
that effect, yes. 

Is the distinction you're drawing anything other 
than semantic, or is it just a semantic difference? 
It can be more than semantic in some applications, 
but I would have to look at a specific question. 
Well, what did you have in mind? 

I don't know what you mean by the question. 

You said it could make a difference in some 
applications, and I said what do you have in mind? 
Well, what I had in mind was that if you were to ask 
me a specific — give me a specific example of where 
this, you know, and ask me whether this was more 
than a semantic difference in that example, I would 
be happy to try to answer. 

Well, I assumed that you meant that you could think 
of circumstances in which it would be more than 
semantic. That's my question. Dr. Zeger. 

Yes, I could think of circumstances. 
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1 

Q. 

And what are they? 

09:59:22 

2 

A. 

There may well be circumstances where one might 

09:59:28 

3 


imagine the world as it was and how it might have 

09:59:32 

4 


been where it's so difficult to formulate how it 

09:59:38 

5 


might have been that it becomes very important to 

09:59:42 

6 


make the distinction between describing the 

09:59:46 

7 


difference between how the world was and how it was 

09:59:48 

8 


in a similar group of people who didn't smoke versus 

09:59:52 

9 


how it might have in some hypothetical situation. 

09:59:58 

10 

Q. 

You better explain that further. I'm not sure what 

10:00:00 

11 


you're talking about. 

10:00:02 

12 


You're saying there are some circumstances 

10:00:04 

13 


where it might be very difficult. What 

10:00:04 

14 


circumstance? What do you have in mind? You 

10:00:08 

15 


obviously have something in mind here, tell us. 

10:00:10 

16 

A. 

I don't have anything else in mind, no. I was 

10:00:12 

17 


trying to be responsive to your question is there 

10:00:14 

18 


ever more than a semantic difference, and I said 

10:00:16 

19 


yes. 

10:00:18 

20 

Q. 

You said something about a — well, let's read back 

10:00:20 

21 


the answer, the witness's answer. 

10:00:22 

22 


(The requested portion read back.) 

10:01:08 

23 

BY 

MR. SILFEN: 


24 

Q. 

So you said there may well be circumstances where 

10:01:10 

25 


one might imagine the world as it was and how it 

10:01:12 


52 
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1 


might have been where it's so difficult to formulate 


10:01:14 


2 how it might have been, it becomes very important to 10:01:18 

3 make the distinction? 10:01:18 


4 

A 

5 


6 


7 

Q 

8 


9 

A 

10 


11 


12 


13 


14 


15 


16 


17 


18 

Q 

19 

A 

20 


21 

Q 

22 


23 


24 

A 

25 

Q 


To make the semantic — I said that the distinction 
would be more than semantic in that case is what I 
was trying to say. 

What case is that? You said one might imagine, what 
do you imagine? 

I could imagine a situation where you talked about 
the difference between intelligence math ability for 
boys and girls, and you could imagine comparing 
otherwise like boys and girls, or you can imagine 
having those boys go back and become girls, and then 
think of the world that way. 

But that seems more than a semantic 
difference because I don't know how to make boys 
into girls. 

Do you know how to make smokers into nonsmokers? 

I'm not really the best person to talk about that. 

I would say Dr. Samet might be better than me. 

I guess I'm not — I'm missing something here. On 
page 7, the construct that you've created, the 
second reduction says, "How much extra disease?" 
Right. 

And it must be — if it's extra disease, it must be 
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1 excess disease, right, excess disease? 

2 A. Okay. 



3 Q. 


And that means that it is disease that would not 


10:02:40 


4 have occurred in some other circumstance, doesn't 10:02:44 

5 it? It's got to be excess over something. 10:02:48 

6 A. What I've tried to say in this line of questioning, 10:02:54 

7 my answers to your questions, is that what we have 10:02:58 

8 done is to calculate the rate of cancer among 10:03:04 

9 smokers as compared to the rate excess over the rate 10:03:08 


10 of cancer among otherwise similar nonsmokers. 

11 That's what I mean by excess. 

12 Q. And so what you have done is you have defined the 

13 excess as the difference between the smoker's 

14 experience and the experience of a nonsmoker, 

15 holding some other factors equal, correct? 

16 A. Not of a nonsmoker, no, of a population of 

17 nonsmokers. 

18 Q. Right, you've done that on a population basis? 

19 A. Yes, thank you. 

20 Q. So you know how to compare smokers and nonsmokers 

21 and figure the difference, don't you? That's what 

22 you've done? 

23 A. That's what we've done, yes. 

24 Q. And for purposes of that calculation, you assumed, 

25 did you not, that had the smokers not smoked, they 
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1 would have had the disease incidence rate of the 

2 similar nonsmokers in the population? 

3 A. What calculation are you referring to? 

4 Q. This calculation, the excess. 

5 A. Table 2? 



7 


Q. 
A. 

8 

9 Q. 

10 A. 

11 Q. 

12 A. 

13 

14 Q. 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


Reduction 2. 

No, that assumption is not necessary to this 
calculation, no. 

It isn't? 

No. 

Explain to me why not. 

Well, what we've done in Table 2 is to compare the 
rate — 

I'm talking about how much extra disease 
calculation, the way you've just described it. 

For purposes of that reduction step, isn't 
it true that you have assumed that had smokers, had 
these people who smoked not smoked, they would have 
had the disease incidence experience of people who 
never smoked? 

MR. HAMLIN: Objection; asked and 

answered. 

THE WITNESS: What I'm telling you is the 
calculation that we did is not based upon that 
assumption. The calculation is based upon — well. 
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1 

2 

3 

4 

5 

6 
7 


that's what I said before. 10:05:50 

BY MR. SILFEN: 


Well, let's look at 

the 

calculation 

there. I 

take 

10:06:18 

it that what you do 

in 

step 2 is you 

see that 

the 

10:06:24 

smokers experienced 

140 

lung cancers 

, correct? 


10:06:32 

In Table 2, the smokers 

had 140 lung 

cancers, 

that' s 

10:06:36 

correct. 
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8 Q. 

9 A. 
10 

11 Q. 

12 A. 

13 

14 

15 

16 

17 Q. 

18 

19 

20 A. 

21 
22 

23 

24 

25 


And how much lung cancers were extra? 10 
Well, what we do — you're talking about in 10 
reduction 2? 10 
Yes. 10 
What we do is we look at the rate of lung cancer 10 
among an otherwise similar group of nonsmokers. 10 
Here 20 out of 5,000. And we take as extra the 10 


difference between the rate for the smokers relative 10 


to the rate for the never-smokers. 10 
And so the smokers experienced 140, the 10 
never-smokers experienced 20, and how many are 10 
excess or extra? 10 
Well, let's see, it says there are 120 attributable 10 
lung cancers out of 140. And the 120 is arrived at 10 
by noticing that there are 20 out of 5,000 in the 10 
never-smoker group. 10 


And if you apply that, if you say how much 10 
higher is the rate for the smokers than for the 10 


56 


1 never-smokers, you would get a reduction that gets 

2 you down from 140 to 120. 

3 Q. I understand. But isn't what you're saying that 20 

4 of the cases among the smokers would have been 

5 expected to occur even if these people had not 

6 smoked? 

7 A. Say it one more time. I'm sorry, I don't mean to 

8 split hairs with you. 

9 MR. SILFEN: Why don't you read it back. 

10 (The requested portion read back.) 
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11 


THE WITNESS: So long as you use 

the word 

10:08:30 

12 


"expectation" in a precise statistical sense, I 

10:08:34 

13 


would agree with that. 


10:08:36 

14 

BY 

MR. SILFEN: 



15 

Q. 

And that's fine, but what do you mean by in 

a 

10:08:44 

16 


precise statistical sense? 


10:08:44 

17 

A. 

Well, the word "expectation" means on average over. 

10:08:48 

18 


you know, population of possible events and 


10:08:52 

19 


basically would get me back to a model that 

compares 

10:08:54 

20 


the smokers to the never-smokers. 


10:08:56 

21 

Q. 

Very good. 


10:08:56 

22 

A. 

Population of smokers to population of 


10:09:00 

23 


never-smokers. 


10:09:00 

24 

Q. 

Very good. I'm tempted to point out to you 

that at 

10:09:16 

25 


the beginning of the discussion I asked if 

we could 

10:09:18 
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1 

2 

3 

4 A. 

5 

6 

7 

8 Q. 

9 A. 

10 Q. 

11 
12 


do this in terms of expected and observed and you 
seemed to not want to do it that way. We muddled 
around for a bit. 

I apologize for muddling around. And you said 
expected and observed. In the epidemiological 
sense, epidemiologists use those words very loosely 
sometimes, and I wasn't willing to do so. 

But now we've used them to your satisfaction? 

Yes. 

Now, on page 10 you say at the top, second sentence 
of the first full paragraph, that the two reductions 
that we've just discussed is an example of the 
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13 

14 

15 A 

16 Q 

17 

18 A 

19 Q 

20 

21 A 

22 

23 Q 

24 

25 A 


1 Q 

2 

3 

4 

5 

6 

7 A 

8 

9 Q 
10 
11 
12 

13 

14 

15 A 


calculation of attributable risk, a standard 
approach in epidemiology. Do you see that? 

Yes, I do. 

In fact, aren't these two steps simply what's 
commonly known as the attributable risk formula? 

Yes . 

Okay. And that goes way back in time to an article 
by Levin, or are you familiar with that? 

I'm familiar with the reference to Levin. I've not 
read his article recently or ever. 

I first saw it in a book by the Lillienfelds. Have 
I got the name right? 

Lillienfeld, yes. 


10:10:26 

10:10:30 

10:10:30 

10:10:38 

10:10:42 

10:10:42 

10:10:48 

10:10:52 

10 : 11:00 

10 : 11:02 

10:11:04 

10:11:06 

10:11:08 
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When I was trying to understand this stuff. And the 
Lillienfelds refers to the attributable risk formula 
as the maximum percentage that could be 
attributable. 

Do you know why they referred to it that 

way? 

I don't know specifically why they referred to it 
that way, no. 

And I think, just to make sure we're talking about 
the same thing, I think of the attributable risk 
formula as being the prevalence of the exposure 
times the quantity relative risk minus one over the 
prevalence of the exposure times quantity R minus 
one plus one. Is that familiar to you? 

It's familiar to me, yes. 


10:11:14 

10 : 11:20 

10:11:24 

10:11:26 

10:11:26 

10:11:28 

10:11:32 

10:11:32 

10:11:44 

10:11:46 

10:11:52 

10:11:56 

10 : 12:00 

10 : 12:02 

10:12:06 


http://legacy.library.ucsf.@3iJUijft^i^c|(i6§a0QfipBif.industrydocuments.ucsf.edu/docs/rqfl0001 



16 

Q. 

And is that what you meant, as well, when you said 

10:12:14 

17 


this was the standard attributable risk formula? 

10:12:18 

18 

A. 

I didn't say it was the standard attributable risk 

10 :12:22 

19 


formula. I said — the application of these 

10 :12:26 

20 


reductions is an example of the calculation of 

10 :12:28 

21 


attributable risk. 

10:12:30 

22 

Q. 

I intended to ask a more precise question. In fact. 

10:12:32 

23 


these two steps taken together are the attributable 

10:12:34 

24 


risk formula, which I just related, they are 

10:12:38 

25 


mathematically the same thing; isn't that correct? 

10:12:40 


59 


1 A. 

2 Q. 

3 

4 A. 

5 Q. 

6 A. 

7 

8 Q. 

9 

10 A. 

11 

12 Q. 

13 

14 A. 

15 Q. 

16 
17 


That's correct. 

And isn't it also correct that that's SAMMEC? Isn't 
that just SAMMEC? 

SAMMEC uses attributable risk. 

That same formula? 

SAMMEC uses the attributable risk in its method, 
yes. 

And when you say attributable risk, that is the same 
formula we just discussed, right? 

SAMMEC uses that formula for the attributable risk 
in its calculations. 

So so far have we done anything other than what 
SAMMEC does? 

When you say so far, you're referring to? 

These first two steps, you haven't controlled for 
anything, you've just applied the attributable risk 
f o rmu1a. 


10:12:40 

10:12:54 

10:12:54 

10:13:00 

10:13:02 

10:13:06 

10:13:06 

10:13:10 

10:13:12 

10:13:14 

10:13:16 

10:13:26 

10:13:26 

10:13:32 

10:13:36 

10:13:40 

10:13:40 
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18 A 

19 

20 
21 
22 

23 Q 

24 

25 


1 

2 A 

3 Q 

4 

5 

6 

7 A 

8 Q 

9 A 

10 Q 

11 A 

12 

13 

14 Q 

15 

16 A 

17 Q 

18 A 

19 Q 

20 A 


So far we've not talked about any application of 
these ideas. So far we have a hypothetical example 
and we're trying to illustrate the two reductions 
which you and I have talked about which comprise 
attributable risk. 

Right. In the hypothetical example we've controlled 
for no other factors and we've just applied the 
attributable risk formula. That's all we've done. 


10:13:42 

10:13:44 

10:13:48 

10:13:58 

10:13:58 

10:13:58 

10:14:00 

10:14:00 
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right? 

Correct. 

That is also the way the core model works, isn't it, 
for the first two reduction steps? You control for 
nothing but age and gender and you apply the 
attributable risk formula? 

Age and gender and insurance status. 

Okay. 

And disease, disease group. 

As an outcome, disease is your — 

Well, we do the calculation separately for lung 
cancer and COPD, and we do it again separately for 
the other major smoking attributable diseases. 

And in these first two steps, are you modeling 
expenditure? 

No. 

You're modeling incidence, correct? 

No. 

What would you call it? 

What would I call what? 


10:14:00 

10:14:04 

10:14:06 

10:14:10 

10:14:12 

10:14:16 

10:14:18 

10:14:30 

10:14:32 

10:14:36 

10:14:40 

10:14:42 

10:14:46 

10:14:48 

10:14:48 

10:14:54 

10:14:54 

10:14:56 

10:15:00 
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21 Q. What is it you're modeling if it's not expenditure 

22 and it's not disease incidence, what is it? 

23 A. Well, in these first two steps we're estimating the 

24 probability of smoking, given you have a particular 

25 disease, as well as the probability of having that 


10:15:04 

10:15:06 

10:15:08 

10:15:12 

10:15:18 


61 


1 


disease, having that disease given you're a smoker 

10:15:22 

2 


and given you're a nonsmoker. Those are the things 

10:15:24 

3 


we're calculating in these two steps. 


4 

Q. 

The two steps taken together gets you an 

10:15:28 

5 


attributable fraction, correct? 

10:15:30 

6 

A. 

Gets you an attributable fraction, that's correct. 

10:15: 34 

7 

Q. 

And that attributable fraction is a fraction based 

10:15:40 

8 


on comparative expenditure between smokers and 

10 :15:44 

9 


nonsmoker? 

10 :15:44 

10 

A. 

No. 

10 :15:44 

11 

Q. 

It is based on comparative what? 

10:15:48 

12 

A. 

Rates of disease. 

10:15:50 

13 

Q. 

And is that different to your view than incidence of 

10:15: 54 

14 


disease? 

10:15:54 

15 

A. 

Yes. 

10:15: 54 

16 

Q. 

What is the difference between the term rates of 

10:15:58 

17 


disease and incidence of disease? 

10:16:00 

18 

A. 

The word "incidence" has a — has units of inverse 

10:16:06 

19 


time. It's the rate occurrence of new disease per 

10:16:12 

20 


unit time, per unit time, whereas we're not making 

10:16:18 

21 


that calculation here. Calculations we're using are 

10:16:24 

22 


more commonly called prevalence, not incidence. 

10:16:26 
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23 Q. I see. Actually, I — you are correct, and I was 

24 just thinking of incidence and the common sense of 

25 an event. 


10:16:30 

10:16:34 

10:16:36 
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1 


But what you're saying is that the term 

10:16:38 

2 


incidence to you is a time based calculation which 

10:16:44 

3 


is opposed to prevalence; that's just what you 

10:16:50 

4 


said? 

10:16:50 

5 

A. 

Yes. 

10:16:52 

6 

Q. 

And what you're doing here is a prevalence 

10:16:56 

7 


computation? 

10:16:56 

8 

A. 

Correct. 

10:16:58 

9 

Q. 

Is that true of all aspects of the model, of the 

10:17:02 

10 


report, is it all prevalence? 

10:17: 04 

11 

A. 

Yes. To the best of my recollection at the moment 

10:17:14 

12 


of all the different things we do. 

10:17:16 

13 

Q. 

I think that's — well, I won't answer the 

10:17:26 

14 


question. I'll stick with your answer. 

10:17:28 

15 


Maybe you should define some terms for me 

10:17:34 

16 


so that we're talking the same language. 

10:17:42 

17 


Cross-sectional/longitudinal, do they have different 

10:17:52 

18 


meanings than prevalence/incidence? 

10:17:54 

19 

A. 

Yes. 

10:17:56 

20 

Q. 

Okay. Can you describe for me the difference 

10:18:02 

21 


between the two pairs of terms? 

10:18:04 

22 

A. 

I could give you a brief definition of each of the 

10:18:12 

23 


terms, if that would start us down the road. 

10:18:14 

24 

Q. 

You could also do a long lecture, I'm sure, about 

10:18:16 

25 


that, but do the brief description. 

10:18:18 
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1 

A. 

A cross-sectional study is one where you look at the 

10:18:22 

2 


occurrence of disease or study of disease. You look 

10:18:26 

3 


at the occurrence of disease at a fixed point in 

10:18:30 

4 


time. In the longitudinal study, one follows people 

10:18:34 

5 


over time and observes the occurrence of disease 

10:18:38 

6 


over time. 

10:18:42 

7 


Cross-sectional studies are typically used 

10:18:44 

8 


to estimate the prevalence of disease. Longitudinal 

10:18:50 

9 


studies can be used to either estimate the 

10:18:52 

10 


prevalence of disease or the incidence of disease. 

10:18:54 

11 


and are often used to do both. 

10:18:58 

12 

Q. 

What about a cohort study? Where would that fit 

10:19:06 

13 


into all those terms? 

10:19:08 

14 

A. 

A cohort study is an example of a longitudinal study 

10:19:12 

15 


where a population of people is followed through 

10:19:14 

16 


time. 

10:19:14 

17 

Q. 

Are there particular problems for which one or the 

10:19:22 

18 


other of those types of studies are appropriate? 

10:19:26 

19 

A. 

As I said, it's possible to estimate prevalence from 

10:19:38 

20 


a longitudinal study or from a cross-sectional 

10:19:40 

21 


study. 

10:19:42 

22 


So if you had a problem where you wanted 

10:19:44 

23 


to estimate prevalence, you could use either sort of 

10:19:46 

24 


study. 

10:19:48 

25 


If you wanted to estimate incidence, you 

10:19:50 


http://legacyJibrary.ucsf.@3ijuijft^i^c|(i6§a0<2fei(sreif.industrydocuments.ucsf.edu/docs/rqfl0001 



64 


1 


wouldn't be able to estimate incidence without 

10:19:54 

2 


auxiliary information from a cross-sectional study. 

10:19:56 

3 


and so in that case a longitudinal study would be 

10:20:00 

4 


necessary. 

10:20:00 

5 

Q. 

And that would also be a cohort study? 

10:20:02 

6 

A. 

You could do so from a cohort study, yes. 

10:20:08 

7 

Q. 

And that was helpful, but what I was actually 

10:20:12 

8 


looking for is: Are there types of problems for 

10:20:16 

9 


which a longitudinal or incidence approach is more 

10:20:22 

10 


appropriate and types of problems for which a 

10:20:26 

11 


cross-sectional or prevalence approach is more 

10:20:30 

12 


appropriate? 

10:20:30 

13 


MR. HAMLIN: Objection to form. 

10:20:34 

14 


THE WITNESS: Yeah, I've not made this 

10:20:36 

15 


pairing of incidence and longitudinal and prevalence 

10:20:38 

16 


and cross-sectional. 

10:20:38 

17 

BY 

MR. SILFEN: 


18 

Q. 

I will unpair them if that will help. 

10:20:42 

19 

A. 

Why don't you ask it again. 

10:20:42 

20 

Q. 

Are there particular — and if I choose the wrong 

10:20:46 

21 


pair, just tell me. We started with incidence and 

10:20:50 

22 


prevalence, so I'll go back to them. 

10:20:52 

23 


Are there particular kinds of problems for 

10:20:54 

24 


which an incidence study is more appropriate and 

10:20:58 

25 


particular kinds of problems for which a prevalence 

10:21:00 


65 


1 study is more appropriate? 10:21:02 
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2 


MR. HAMLIN: Objection to form. 

10:21:06 

3 


THE WITNESS: Well, incidence, the 

10:21:08 

4 


incidence of disease is a quantity that you may be 

10:21:10 

5 


interested in. If you're interested in incidence. 

10:21:14 

6 


then you should do a study that enables you to 

10:21:16 

7 


estimate incidence. 

10:21:18 

8 


If you're interested in prevalence, you 

10:21:20 

9 


should do a study that enables you to estimate 

10:21:22 

10 


prevalence. 

10:21:22 

11 


For example, you can do a cross-sectional 

10:21:24 

12 


study or a longitudinal study. If you're interested 

10:21:28 

13 


in incidence, you should do a study that enables you 

10:21:30 

14 


to estimate that. So it depends what the scientific 

10:21:32 

15 


question is. 

10:21:32 

16 

BY 

MR. SILFEN: 


17 

Q. 

That's fair. But what I was getting at is whether 

10:21:36 

18 


there are particular kinds of questions which are 

10:21:40 

19 


ordinarily approached through incidence or more 

10:21:44 

20 


readily approached through incidence and other kinds 

10:21:48 

21 


of problems that are more readily approached through 

10:21:50 

22 


prevalence? 

10:21:52 

23 


MR. HAMLIN: Objection to form. 

10:21:54 

24 


THE WITNESS: Yes. 

10:21:58 

25 

BY 

MR. SILFEN: 



1 Q. Could you tell me what they are? 10:22:00 

2 A. Yes. If you want to know the incidence of disease, 10:22:06 

3 then you should do a study that enables you to 10:22:10 
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4 


estimate incidence. 


10 : 22:10 


I mean, if your question is what is the 


10 : 22:12 


incidence of lung cancer, then you should do an 


10:22:16 


incidence study. 


10:22:18 


Q. What is that? I think what you're saying is 


10 : 22:20 


probably more enlightening to you than it is to me. 10:22:22 


To me it sounds like the same thing over. 


10:22:24 


I was actually looking for an example of a 10:22:26 


type of problem to which you would kind of 


10:22:30 


instinctively say, well. I'll do an incidence 


10:22:32 


study. 


10:22:34 


But what do you mean when you say if 


10:22:36 


you're interested in the incidence of lung cancer. 


10:22:38 


what does that mean? If I'm interested in the 


10:22:40 


incidence of lung cancer, what is it I'm interested 10:22:42 


in? Tell me. 


10:22:42 


MR. HAMLIN: Objection to form. 


10:22:44 


THE WITNESS: Well, you're interested in 10:22:48 


the rate of occurrence of new cases of lung cancer 


10:22:52 


per unit time, per unit time per person. 


10:22:56 


24 BY MR. SILFEN: 


25 Q. The rate of occurrence of new cases. And describe 


10:23:08 


the other side of the coin then. Prevalence I would 10:23:12 


be interested in what? 


10:23:12 


3 A. In the fraction of persons that have lung cancer at 10:23:18 


a given time. 


10:23:18 


5 Q. And that would refer to a past exposure? It's the 


10:23:24 


burden of past accrual of the disease at the present 10:23:28 
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7 


8 A. 

9 

10 

11 

12 

13 

14 Q. 

15 

16 

17 

18 A. 

19 

20 Q. 

21 
22 

23 A. 

24 

25 Q. 


point in time? 

Well, it's the fraction of people at the present 
point in time who have that disease. Different 
diseases have different mechanisms. 

But it would be, you know, the number of 
people per population unit who have the disease at a 
particular time. 

Okay. Now, back to my original question. Can you 
give me examples of problems, for instance, problems 
you've addressed, that called for an incidence 
approach? 

I'm trying to think of what I've done in my career. 
It's sort of murky right now. 

I share with you. I'm feeling that my career is 
also murky right now. We're in the same murky 
boat. We can commiserate at lunch. 

I can give an example where prevalence is the 
focus. Would that be helpful? 

I'd really like an example of both. Let me — I 


10:23:30 

10:23:34 

10:23:38 

10:23:38 

10:23:42 

10:23:44 

10:23:46 

10:23:50 

10:23:54 

10:24:04 

10:24:04 

10:24:12 

10:24:16 

10:24:20 

10:24:24 

10:24:26 

10:24:34 

10:24:38 

10:24:44 


1 take it this is an area of expertise for you; is it 

2 not? 

3 A. I wouldn't consider myself, you know, a person with 

4 special expertise in this area. These are basic 

5 terms used in epidemiology. I am a biostatistician 

6 who is knowledgeable about epidemiology. 

7 Q. I didn't — you are the — what is your title? 

8 You're the head of the department of epidemiology at 


10:24:46 

10:24:46 

10:24:54 

10:24:56 

10:25:00 

10:25:04 

10:25:10 

10:25:12 
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9 


Johns Hopkins? 


10:25:12 


10 

A. 

No, I'm not. 

10:25:14 

11 

Q. 

What are you? 

10:25:14 

12 

A. 

I'm a professor and chair of the department of 

10:25:16 

13 


biostatistics at Johns Hopkins University. 

10:25:18 

14 

Q. 

Is there a separate department of epidemiology? 

10:25:22 

15 

A. 

Yes. Dr. John Samet is the chairman of 

10:25:26 

16 


epidemiology. 

10:25:28 

17 

Q. 

I'm sorry. 

10:25:28 

18 

A. 

No problem. 

10:25:30 

19 

Q. 

Yet unless I was also mixing the C.V.s, I believe 

10:25:42 

20 


you've written a book on longitudinal analysis. 

10:25:46 

21 


haven't you? 

10:25:46 

22 

A. 

I've coauthored a book entitled, "The Analysis of 

10:25:50 

23 


Longitudinal Data," yes. 

10:25:52 

24 

Q. 

So unless it's a very, very, very short book, you 

10:25:56 

25 


must have some familiarity with circumstances in 

10:25:58 


69 


1 


which longitudinal data is analyzed and 


10:26:02 

2 


appropriate? 


10:26:02 

3 

A. 

I'm sorry, is analyzed appropriately you 

said? 

10:26:10 

4 

Q. 

Yes. 


10:26:10 

5 

A. 

Yes. 


10:26:10 

6 

Q. 

Now, why don't you give us some examples 

of when 

10:26:14 

7 


longitudinal data has been used and when 

it has been 

10:26:16 

8 


used in an incidence study. 


10:26:18 

9 

A. 

Well, let's see, I've been involved with 

a study 

10:26:26 

10 


looking at children's morbidity, children 

in 

10:26:34 

11 


Indonesia. And you might be interested in the 

10:26:36 
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12 


question, what is the incidence rate for respiratory 

10:26:42 

13 


infection among those children? 

10:26:44 

14 


And you might be interested in asking 

10:26:44 

15 


whether vitamin A deficiency predisposes children to 

10:26:50 

16 


have an increased incidence of respiratory 

10:26:54 

17 


infection. 

10:26:54 

18 


Or you might ask a different question of 

10:26:58 

19 


those same data. You might ask the question, what 

10:27:00 

20 


is the prevalence of respiratory infection? And 

10:27:04 

21 


does vitamin A deficiency predispose children to 

10:27:08 

22 


have a higher prevalence of respiratory infection? 

10:27:12 

23 

Q. 

And, in fact, that's the study referenced in the 

10:27:20 

24 


section you wrote on goodness of fit? 

10:27:22 

25 

A. 

It's actually a different study, but a similar 

10:27:24 


70 


1 


problem, yes. 

10:27:26 

2 

Q. 

So I may just end up repeating what you just said. 

10:27:46 

3 


which means I will probably garble it, but I want to 

10:27:50 

4 


get it straight. 

10:27:52 

5 


You said that if you were interested in 

10:27:56 

6 


the occurrence of respiratory disease among persons 

10:28:02 

7 


with a vitamin A deficiency, that a prevalence 

10:28:10 

8 


study, a snapshot at a point in time, would be 

10:28:14 

9 


appropriate. Did I say that correct? 

10:28:14 

10 

A. 

I didn't say that. 

10:28:16 

11 

Q. 

I understand. I was trying to get — am I correct? 

10:28:20 

12 

A. 

I don't know how to answer the question. I mean. 

10:28:26 

13 


are you — prevalence study, what I said is a 

10:28:28 
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14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


prevalence study may be of interest. 

You might want to know are vitamin A 
deficient children more likely to have at any given 
time or averaged over a period of time more likely 
to have respiratory infection than children who are 
not vitamin A deficient? 

That's a different question. That's a 
prevalence question. It can be studied in a 
longitudinal analysis, as is illustrated in the book 
you referred to. 

Or you could ask a different question. 

You could ask, is the occurrence of a new case of 


10:28:32 

10:28:34 

10:28:40 

10:28:42 

10:28:46 

10:28:48 

10:28:50 

10:28:52 

10:28:56 

10:28:58 

10:29:00 

10:29:02 


71 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 Q. 

11 
12 

13 

14 A. 

15 

16 


respiratory infection, is that influenced by whether 
or not the child is vitamin A deficient? And those 
are different questions and refer to different 
aspects of the biologic process by which a child has 
respiratory infection. In order to have it, you can 
get it but then you have to keep it. 

So prevalence reflects both the incidence 
of the event but also the duration of time for which 
it occurs. 

Now, if you — in the study you reference in your 
article, the one that I'm familiar with, the 
children or some group of them were receiving 
vitamin A supplements; is that correct? 

There is a paper — yes, in this report we referred 
to a paper by Somer and Zeger in which the example 
was a supplementation trial, yes. 


10:29:06 

10:29:10 

10:29:12 

10:29:16 

10:29:20 

10:29:22 

10:29:26 

10:29:30 

10:29:30 

10:29:36 

10:29:48 

10:29:52 

10:29:54 

10:30:02 

10:30:08 

10:30:12 
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17 Q. And would that have involved longitudinal data? 

18 A. The data that are reported there, to the best of my 

19 recollection, were collected — well, they were 

20 definitely collected in the longitudinal study. But 

21 the data that are reported there would not be an 

22 example of what I would call longitudinal data. 

23 Q. Well, I'm less interested in that. If I wanted to 

24 know, if I administer vitamin A to a group of 

25 children and compare them to a group that does not 
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1 get vitamin A supplement — 

2 A. Right. 

3 Q. — and I want to know the effect of that experiment, 

4 what analysis would I use? 

5 A. It would depend on what the question you're asking 

6 is. What specific question do you want to ask of 

7 those data? 

8 Q. Well, I'm sorry if I'm at a loss. I would assume 

9 that the question would be, does the administration 


10 


of vitamin A supplements change 

or 

affect the 

11 


occurrence of whatever condition 

you're 

studying? 

12 

A. 

Well, 

I'm asking because I need 

to 

know 

what 

13 


condition. 




14 

Q. 

Oh, what is the condition you were 

studying at the 

15 


time? 

Was it mortality? 




16 

A. 

Yes . 





17 

Q. 

Okay. 

Then let's say mortality. 




18 

A. 

Okay. 

Got you. So we're going 

to 

look 

at the 
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19 


mortality and the vitamin A supplementation. 

10:32:00 

20 

Q. 

Yes . 

10:32:02 

21 

A. 

Then if you'll just repeat your question. 

10:32:04 

22 

Q. 

Whether vitamin A supplements have a healthy or an 

10:32:12 

23 


unhealthy or no impact on mortality? 

10:32:14 

24 

A. 

Okay. 

10:32:18 

25 

Q. 

And how would you do that study? 

10:32:20 
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1 A. 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 Q. 

15 

16 

17 A. 

18 

19 Q. 

20 
21 


Well, the way that study was done is children were 
followed forward for a fixed period of time. I 
believe it was a year. 

At the beginning of the period they were 
randomized to receive either vitamin A or not. And 
the number of children that died in the interval — 
and death is an incident event, not a — well, it's 
prevalent in the sense that you're gone, but it's 
different from respiratory infection in that it only 
happens once. 

So the number that died in the group that 
got vitamin A were compared to the number that died 
in the group that didn't. 

Am I to understand that is an incidence approach? I 
say that only because you used the word incidence 
just now. 

The question of interest was about the rate of 
mortality, which is an incident event, yes. 

I was trying to translate it into the terms you used 
before. If I were interested in — suppose that we 
were considering death from malnutrition. I'm 
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22 

23 

24 

25 


picking that out of the air. 

We would be looking at the new cases of 
malnutrition among the persons who got the 
supplement and the persons who didn't get the 
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1 

2 A. 

3 Q. 

4 A. 

5 

6 Q. 

7 A. 

8 Q. 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 A. 

21 
22 
23 


supplement, correct? 

Death or malnutrition? 

Malnutrition. 

So we're now switching away from death and looking 
at — 

Yes. 

— some measure of malnutrition. 

I was trying to hook up with your term "new cases" 
which made sense to me. I understood when we were 
describing incidence, I understood that you were 
saying that there you're looking at the new cases, 
and I was trying to find an example that illustrated 
that. 

And would that be an example if the 
outcome we were interested in was malnutrition and 
we gave one group a vitamin A supplement and another 
group not and we compared the new cases of 
malnutrition in the group that got the supplement 
and the group that didn't? 

That would be an example of an incidence study, yes, 
a study whose purpose was to estimate the incidence, 
the relative incidence rates for vitamin A deficient 
versus vitamin A sufficient children. 
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24 Q. If the question were, as it was there, whether the 10:34:50 

25 vitamin A supplement reduces the occurrence of 10:34:56 


75 


1 

2 

3 A. 

4 Q. 

5 

6 A. 


7 


9 

10 

11 Q. 

12 

13 

14 A. 

15 

16 Q. 

17 A. 

18 

19 Q. 

20 
21 
22 

23 

24 

25 


malnutrition, would it have to be done by an 
incidence study? 

No. 

How could you answer that question without an 
incidence study? 

I could do a cross-sectional study. I could 
randomize children to receive vitamin A or not, and 
then six months, two months later come back and do a 
cross-sectional study and see what the rate, the 
prevalence was of malnutrition in the two groups. 

I see. And that — you referred to death as an 
incidence event. Why do you call that an incidence 
event as opposed to malnutrition? 

Well, death happens once per person, at least so far 
as I know. 

We may all be surprised. 

Yes. Malnutrition can come and go, respiratory 
infection can come and go. 

That's very interesting, and thank you for your 
patience. 

Now, where we were when we got on to 
incidence and prevalence was we were up to the third 
step or the third reduction step. And that is 
described on the top of page 8. 

Here's my question: We've already 
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1 


determined the standard smoking attributable 

10:36:58 

2 


fraction, or as you put it on page 10 the 

10:37:14 

3 


attributable risk. 

10:37:20 

4 


What more is there to do? Why do we need 

10:37:24 

5 


a third step? 

10:37:24 

6 

A. 

Well, it's likely that persons with major smoking 

10:37:36 

7 


attributable diseases will have larger expenditures 

10:37: 42 

8 


than persons without — than otherwise similar 

10:37:46 

9 


persons without those diseases. 

10:37:48 

10 


But it is our expectation that there is 

10:37:56 

11 


some baseline level of expenditure in that otherwise 

10:38:00 

12 


similar population. And, therefore, that we 

10:38:04 

13 


shouldn't attribute all of the expenditures for a 

10:38:08 

14 


person, for a group of people who are smokers, to 

10:38:12 

15 


their smoking since otherwise similar groups also 

10:38:14 

16 


have some expenditures. 

10:38:16 

17 


So we take only the difference, the amount 

10:38:18 

18 


by which the expenditures are increased in the 

10:38:22 

19 


smoking group, and create a fraction which we apply 

10:38:26 

20 


as the third reduction. 

10:38:26 

21 

Q. 

Well, I take it that the point is that even if the 

10:38: 42 

22 


people have not smoked and not gotten a 

10:38:48 

23 


smoking-related disease, you would expect them to 

10:38:50 

24 


have some baseline cost? 

10:38:54 

25 

A. 

That's correct. 

10:38:54 
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1 

Q 

2 


3 


4 


5 

A 

6 

Q 

7 


8 


9 

A 

10 

Q 

11 


12 


13 


14 


15 


16 

A 

17 

Q 

18 


19 


20 

A 

21 

Q 

22 

A 

23 

Q 

24 


25 



Well, isn't what you're doing then trying to 
identify the dollar expenditure that would have 
existed had this person never smoked? Isn't this 
just a part of the same analysis? 

No. 

Well, I'm — would you agree with me if I said 
you're trying to identify the dollar expenditure 
that would exist for a similar nonsmoking person? 

No. 

Well, then I am confused. In this paragraph at the 
top of page 8, the first sentence says, "The smokers 
receiving medical services for diseases attributable 
to their smoking would, on average, have incurred 
some health care cost anyway, even if they had not 
gotten a smoking attributable disease." 

Right. 

So aren't you trying to calculate the costs that 
would have been incurred by a person who is a 
never-smoker without the disease? 

No. 

Well, Dr. Zeger, take a look at your next sentence. 
The one that says we calculate how much greater — 

— the average expenditures are for smokers with a 
smoking attributable disease relative to average 
expenditures of never-smokers without the disease. 


10 : 
10 : 
10 : 
10 : 
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10 : 
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1 A. That's what we do. 10: 

2 Q. And the extra part — well, but aren't you 10: 
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39:56 

39:58 

39:58 

40:08 

40:10 

40:18 

40:20 

40:28 

40:34 

40:36 

40:38 

40:42 

40:44 

40:46 

40:58 

41:16 

41:18 

41:18 

41:28 

41:32 

41:34 

41:38 

41:40 


41:42 

41:56 



3 

4 

5 A. 

6 Q. 

7 

8 
9 

10 A. 

11 
12 

13 Q. 

14 

15 A. 

16 

17 

18 Q. 

19 

20 
21 
22 

23 

24 A. 

25 


identifying a baseline dollar amount that would be 
expended by a never-smoker without the disease? 

No. 

Okay. Well, you mean you do not identify — let's 
take this part. You do identify the average 
expenditures of never-smokers without the disease, 
don't you? 

We estimate the average expenditures for, I'm sorry, 
never-smokers without the disease? Yes, we do do 
that, for a population of such people, yes. 

Is your problem with my articulation that I say 
person rather than group? 

That's one problem I have, yes. I wouldn't know how 
to speculate about a particular person's 
expenditures. 

I didn't mean to — well, then let me say it again. 
On a population-wide average basis, aren't you 
trying to determine here the expenditure that would 
have occurred for a never-smoker without a major 
smoking-related disease, so that that amount will 
not be attributed to smoking? 

I would answer no again. And just so it's 
absolutely clear what we do do, we calculate how 
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10:42:20 

10:42:20 

10:42:24 

10:42:28 

10:42:30 

10:42:34 

10:42:34 

10:42:38 

10:42:40 

10:42:42 

10:42:50 

10:42:54 

10:43:04 

10:43:12 

10:43:16 

10:43:18 

10:43:24 

10:43:28 


79 


1 

2 

3 

4 


much greater the average expenditures are for 
smokers with a smoking attributable disease than for 
the average expenditures for never-smokers without 
the disease. 
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5 

6 

7 

8 

9 Q. 
10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


That's what we attempt to estimate in this 
calculation. We take that in and put it into a 
fraction and apply that fraction to the total 
expenditures for the population. 

Yeah, but doesn't that mean that you are removing 
from the smoking attributable amount the fraction 
that is represented by the average expenditures of 
never-smokers without the disease? 

MR. HAMLIN: Objection to form. 

THE WITNESS: It just went on too long, 

I'm sorry, I lost track. 

MR. SILFEN: We'll read it back. 

(The requested portion read back.) 

MR. HAMLIN: Same objection. 

THE WITNESS: You'll have to read it 
again, I'm sorry, I don't think it was a complete 
statement of what we're doing. 

(The requested portion read back.) 

MR. HAMLIN: Same objection. 

THE WITNESS: We're not removing from the 
smoking attributable amount anything. We're trying 
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1 to calculate the smoking attributable expenditures. 

2 BY MR. SILFEN: 

3 Q. This is called a reduction step. Dr. Zeger. I mean, 

4 are you telling me that your third reduction step 

5 does not remove an amount? 

6 A. Not from the smoking attributable expenditures, it 

7 doesn't remove an amount, no. 
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8 

Q. 

What does it remove it from? 

10:45:08 

9 

A. 

It ends up getting us the smoking attributable 

10:45:12 

10 


expenditures. 

10:45:12 

11 

Q. 

So it isn't a reduction? 

10:45:14 

12 

A. 

It's a reduction, yes, it is a reduction. 

10:45:16 

13 

Q. 

So it's a reduction. And isn't it a reduction which 

10:45:20 

14 


is measured by — well, let's go back to the 

10:45:30 

15 


beginning. 

10:45:30 

16 


Let's say in the population if you're a 

10:45:40 

17 


smoker and you get lung cancer, you spend $1,000 for 

10:45:48 

18 


the treatment of that lung cancer. You with me? 

10:45:52 

19 

A. 

Well, go on, yeah. 

10:45:58 

20 

Q. 

And let's say we find — let me suppose that a 

10:46:08 

21 


nonsmoker, an average nonsmoker without lung cancer 

10:46:12 

22 


or any other smoking-related disease, on the average 

10:46:16 

23 


spends $200 on his medical conditions, correct? You 

10:46:26 

24 


with me? 

10:46:26 

25 

A. 

I hear you. 

10:46:30 
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1 

2 

3 

4 

5 

6 

7 

8 
9 


THE VIDEOGRAPHER: We've got to cut. 

MR. SILFEN: Fine, you can think about 
it. The rest of the question is: We don't want to 
charge smoking with the rest of the $200, that is 
the question, and that is what this is about. Okay, 
break time. 

(A break was taken.) 

THE VIDEOGRAPHER: We're back on the video 
record. This is the second tape in the videotape 
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10 


testimony of Scott Zeger. The time is now 10:54 

10:54:24 

11 


a .m. 

10:54:26 

12 

BY 

MR. SILFEN: 


13 

Q. 

Dr. Zeger, we've been talking about the three 

10:54:32 

14 


reduction steps of the core model, correct? 

10:54:36 

15 

A. 

Correct. 

10:54:36 

16 

Q. 

I want to talk for a moment about the dollar figure 

10:54: 42 

17 


that those reduction steps are applied to. 

10:54:46 

18 

A. 

Okay. 

10:54:48 

19 

Q. 

And I'm going to call it the pot. 

10:54:50 

20 

A. 

Okay. 

10:54:52 

21 

Q. 

Because that's what Peter and I have been saying for 

10:54:58 

22 


months. That's the only way we know how to say it. 

10:55:00 

23 


If you look at page 22 — 

10:55:02 

24 

A. 

Page 22? 

10:55:06 

25 

Q. 

Yes. 

10:55:08 
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1 

A. 

Yes . 

10:55:10 

2 

Q. 

You have in your four indented paragraphs, you have 

10:55:12 

3 


the — well, paragraphs 2, 3 and 4 are the three 

10:55:18 

4 


reduction steps. But what comes before that is what 

10:55:26 

5 


I'm calling the pot that the reduction steps are 

10:55:28 

6 


applied to. 

10:55:28 

7 


And that is, quote, "Total expenditures 

10:55:30 

8 


for persons with smoking attributable disease"; is 

10:55:34 

9 


that correct? 


10 

A. 

Correct. 

10:55:36 

11 

Q. 

Now, when we say "total expenditures for persons 

10:55:40 

12 


with smoking attributable disease, " I take it we do 

10:55: 42 
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13 


not mean only the expenditures for that smoking 

10:55:48 

14 


attributable disease? 


15 

A. 

Correct. 

10:55:52 

16 

Q. 

We mean any expenditures during the period under 

10:55:56 

17 


study by a person with a major smoking attributable 

10:56:04 

18 


disease? 


19 

A. 

It's the expenditures for that person with a major 

10:56:10 

20 


smoking attributable disease in a one-year period. 

10:56:12 

21 

Q. 

And in this case, we're using the term person 

10:56:16 

22 


because we really are talking about individuals? 

10:56:18 

23 

A. 

Well, we're talking about all of the individuals who 

10:56:24 

24 


have a major smoking attributable disease, yes. 

10:56:28 

25 

Q. 

And what that means, for instance, is that if, say. 

10:56:38 
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1 


you're a person with lung cancer or COPD, you with 

10:56:42 

2 


me so far? 

10:56:42 

3 

A. 

Yes . 

10:56:42 

4 

Q. 

You are then going to be treated in your lung 

10:56:48 

5 


cancer/COPD model, correct? In the core or in the 

10:56:58 

6 


refined disease? 

10:56:58 

7 

A. 

You will be included in the expenditures, the total 

10:57:06 

8 


expenditures, for persons who have lung cancer or 

10:57:08 

9 


COPD, correct. 

10:57:10 

10 

Q. 

And, for instance, in the refined disease model. 

10:57:16 

11 


when we do a calculation to figure the smoking 

10:57:20 

12 


attributable risk, that is a calculation that is 

10:57:22 

13 


specific to the lung cancer/COPD persons, correct? 

10:57:26 

14 

A. 

Correct. 

10:57:28 
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15 Q. 

16 

17 

18 A. 

19 Q. 

20 
21 

22 A. 

23 

24 

25 


Now, if I understood your prior answer, the dollars 
that you put in the pot for the lung — let's call 
them the lung cancer persons, can I? 

Okay. 

Would include dollars expended for heart disease and 
stroke and all the other diseases in the CHD/stroke 
category, correct? 

So long as the person had lung cancer or COPD, so 
long as a person had lung cancer or COPD. All of 
that person's expenditures would go into the lung 
cancer/COPD total. 
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1 Q. 

2 

3 

4 

5 A. 

6 Q. 

7 

8 A. 

9 

10 

11 

12 Q. 

13 

14 

15 

16 
17 


So that could be heart disease. It could be 
stroke. It could be laryngeal cancer. It could be 
the diseases that fall under the heading diminished 
health status, as well? 

Correct. 

And it could be car accidents, broken bones, and 
poisoning, too? 

Correct. There may — Dr. Wyant is the person who 
has worked with the billing data. I know there were 
some exclusions, but I don't know specifically. So 
subject to any exclusions that were done, correct. 
Okay. Now, an alternative way to approach this 
would have been to limit the pot to expenditures for 
lung cancer/COPD. 

I'm not asking you if that was feasible, 
I'm just saying in the abstract that is an approach 
which I'm now going to ask you about, okay? 
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18 A. 


(No response.) 


10:59:16 


19 Q. I understand why that's difficult for you. Let me 10:59:20 

20 just ask it this way: Why didn't you simply limit 10:59:22 

21 the pot to dollars spent on lung cancer/COPD? 10:59:26 

22 A. Again, Dr. Wyant would be the person who was most 10:59:30 

23 knowledgeable about the billing data. But so far as 10:59:34 

24 I understand it, in my conversation with him, it's 10:59:38 

25 not possible to disaggregate all of the bills and 10:59:42 
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1 assign them to a particular condition. 

2 And you'd have to really speak to him 

3 further about, you know, why that's not possible or 

4 what the issue is. 

5 But when you take all of the bills for a 

6 particular person who is suffering from lung cancer 

7 or COPD, it's not possible to say this bill is for 

8 their lung cancer but that bill over there is for 

9 something else. 


10 Sometimes I guess there's information 

11 about that, but it's not sufficient to actually make 

12 the split that you're suggesting. 

13 Q. Now, have you done an analysis, or has someone done 

14 it, of the number of dollars that go into the lung 

15 cancer/COPD pot that are not — that are clearly not 

16 lung cancer/COPD dollars? 

17 A. I've not done that analysis, no. 

18 Q. Well — 

19 A. Can I just clarify one thing? I want to clarify the 
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20 


word "pot" 

again, I'm sorry. The 

pot 

is the 

thing 

11:01:16 

21 


before any 

reductions get applied; 

is 

that correct? 

11:01:18 

22 

Q. 

Yes. 





11:01:20 

23 

A. 

No, I've not done that. 




11:01:22 

24 

Q. 

It's funny 

that you want to use such 

a silly 

term. 

11:01:26 

25 


but, okay. 

I'll go with it. 




11:01:28 


86 


1 

A. 

Tricked again. 

11:01:30 

2 

Q. 

Good-natured answer, thank you. 

11:01:36 

3 


The reports of the defense experts in this 

11:01:52 

4 


case, which I know you've looked at briefly. 

11:01:56 

5 


suggests that generally something like 50 percent of 

11:02:02 

6 


the dollars in the pot, whether it's the lung 

11:02:10 

7 


cancer/COPD pot or the heart disease/stroke pot, are 

11:02:12 

8 


not lung cancer/COPD or heart disease/stroke 

11:02:16 

9 


dollars. Did you see that report? 

11:02:18 

10 

A. 

I did not, no. 

11:02:18 

11 

Q. 

Let's assume for a moment that that is accurate or 

11:02:32 

12 


that it's reasonably accurate. Would that be a 

11:02:34 

13 


matter of concern for this computation? 

11:02:36 

14 

A. 

First, I have no, you know, basis for assuming that 

11:02:42 

15 


that is accurate. 

11:02:42 

16 

Q. 

I understand. 

11:02:44 

17 

A. 

But I have no reason to be concerned whether it was 

11:02:48 

18 


or not. 

11:02:48 

19 

Q. 

Okay. Now, to go back to the subject we were 

11:03:08 

20 


talking about a moment ago, which is the third 

11:03:14 

21 


reduction step. 

11:03:28 

22 


Let me ask you this question: Suppose 

11:03:38 
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23 that the dollars in the pot were just lung 

24 cancer/COPD dollars. You with me? 

25 A. This is a hypothetical? 


11:03:46 

11:03:48 

11:03:52 
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1 Q. 

2 

3 

4 

5 A. 

6 Q. 

7 A. 

8 

9 Q. 
10 
11 
12 

13 

14 A. 

15 Q. 

16 A. 

17 

18 

19 Q. 

20 
21 
22 

23 

24 


It is a hypothetical. I'm not arguing that it was 
doable, I'm just saying suppose they were. 

Would the third step, third reduction 
step, have still been necessary? 

Well, let me say in that — 

Take your time. 

Just repeat it for me one more time. I'm sorry, I'm 
trying to understand. 

I tried to work up to it so that we had got 
everything on the table. 

Let's assume that the pot is just 
expenditures on the disease under study, lung 
cancer/COPD, okay? You with me on the premise? 

I disagree with the premise. 

I understand. 

And the only reason I disagree is because the 
premise is not possible in this context. So, yeah, 
that's, anyway — 

Well, you know, the people who have been running the 
SAMMEC for all these years are going to be kind of 
surprised because this is what they do, they take 
their smoking attributable percentage for lung 
cancer and they apply it times dollars spent on lung 
cancer. 
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I'm not 


25 


But let's just assume — 


— don't 11:05:28 


1 


want to argue with you about whether it's possible 

11:05:32 

2 


or not. I want to know why the third step was 

11:05:34 

3 


necessary. That's the point of my question. 

11:05:36 

4 


So think of it as an intellectual 

11:05:42 

5 


exercise, and the point is the third step. It is 

11:05:46 

6 


not whether or not you can collect the dollars in 

11:05:48 

7 


the pot differently. Okay? Are you at least 

11:05:54 

8 


hearing me? 

11:05:54 

9 

A. 

I hear you. 

11:05:56 

10 

Q. 

That's good. Very small comfort. 

11:05:58 

11 

A. 

I'm worried if I don't communicate that I hear you. 

11:06:04 

12 


but I hear you. 

11:06:04 

13 

Q. 

My question is this: Had the pot been just lung 

11:06:10 

14 


cancer dollars, would the third step reduction still 

11:06:14 

15 


be necessary? 

11:06:16 

16 


MR. HAMLIN: Objection to foundation. 

11:06:18 

17 


THE WITNESS: It's not possible to — in 

11:06:28 

18 


the problem that we worked on, it's not possible to 

11:06:34 

19 


create a pool that has only lung cancer dollars. 

11:06:38 

20 


which is what made the third reduction necessary. 

11:06:44 

21 

BY 

MR. SILFEN: 


22 

Q. 

Okay. Let me tell you what was on my mind. I could 

11:07:06 

23 


think of two reasons for doing the third step. One 

11:07:10 

24 


is that we've put dollars into the pot that are not 

11:07:20 

25 


disease specific and now we have to take some of 

11:07:22 
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1 

2 

3 

4 

5 

6 

7 

8 
9 


10 


11 

A 

12 

Q 

13 


14 


15 


16 


17 


18 

A 

19 


20 

Q 

21 


22 

A 

23 

Q 

24 


25 



them out. That's one reason. 

And I take it that you have just affirmed 
that that is the reason for the third step. I'm not 
going to ask you to say yes because it doesn't 
really matter. 

I could think of another reason for the 
third step, okay. Suppose the purpose of the 
calculation were to put the state in the position it 
would have been had people not smoked or smoked 
differently. Okay? Are you hearing me, as we say? 
Yep. 

Then I can see the third step also, because what you 
would be saying is if these people didn't get lung 
cancer, they would still have had some medical 
condition and it would have cost the state money and 
we have to take that into account. 

Do you understand what I'm saying? 

I don't know if you've asked a question. I hear 
what you're saying. Is there a specific question? 
Was that latter analysis that I just went through 
part of your thinking in doing the third step? 

Would you repeat the latter analysis again? 

No. Maybe we'll come back to it. Let's just affirm 
a couple of things that I think will probably be 
easy. 
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1 


In the core model, the first two reduction 


11:09:28 


2 steps are done based on NMES, correct? 

3 A. The probabilities used in the first two steps are 

4 estimated from the NMES data, yes. 

5 Q. And as we said before, as you said before, the 

6 factors controlled for or taken into account are 

7 age, gender, insurance status, and disease? 

8 A. Correct. And also I think based also whether 

9 someone was in — there's sort of an adjustment for 

10 nursing home, as well. 

11 Q. I don't know what that is, but — 

12 A. It's in a footnote, I think. 

13 Q. Now, when you say you controlled for insurance 

14 status, what do you mean by that? How was that 

15 done? 

16 A. To the best of my recollection, we distinguished 

17 people who received their health care from their 

18 employer or their union from those who were on 

19 public assistance in the application. 

20 Q. Why did you do that? 

21 A. We were attempting to apply the core model to three 

22 pots, the Medicaid pot, the GAMC pot, and the Blue 

23 Cross/Blue Shield pot, and so we made our 

24 calculations specific to those pots. 

25 Q. And that is, as you stated earlier, I take it. 
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1 because you thought that the results for the 

2 relevant populations might be different? 

3 A. We had the information and tried to make a 



4 


calculation specific to these groups as defined by 

11:12:20 

5 


the variables I indicated, yes. 

11:12:22 

6 

Q. 

Well, if that's so, why did you not examine these 

11:12:34 

7 


separate populations in the refined model, the 

11:12:42 

8 


diminished health status model, and the nursing home 

11:12:46 

9 


model? 

11:12:46 

10 

A. 

I'm sorry, I don't understand your question. Why 

11:12:50 

11 


did we not examine these populations? 

11:12:54 

12 

Q. 

These separate populations. Why didn't you control 

11:12:56 

13 


for insurance status? 

11:12:58 

14 

A. 

In some applications we did control for insurance 

11:13:00 

15 


status. 

11:13:00 

16 

Q. 

What application? 

11:13:02 

17 

A. 

I'd have to look at the specific equations. I don't 

11:13:06 

18 


recall exactly. 

11:13:08 

19 

Q. 

You don't recall any, can't recall any? 

11:13:14 

20 

A. 

I recall the insurance status was a variable in some 

11:13:18 

21 


of the regressions. I don't recall which of them it 

11:13:22 

22 


was a variable. 

11:13:22 

23 

Q. 

Well, it was a variable, but you did not limit the 

11:13:26 

24 


analysis to that population? In the core model, you 

11:13:32 

25 


limited your analysis to the public aid population. 

11:13:34 


92 


1 correct? 

2 A. No, we did an analysis for Medicaid, GAMC, and for 

3 Blue Cross/Blue Shield. We did not limit our 

4 analysis to the public. 

5 Q. I have a feeling that you know what I meant. When 


11:13:36 
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11:13:48 

11:13:50 


http://legacy.library.ucsf.@3iJUijft^i^c|(i6§a0QfipBif.industrydocuments.ucsf.edu/docs/rqfl0001 



you were looking for the smoking attributable 


11:13:52 


7 


fraction for the Medicaid or public aid population. 

11:13:56 

8 


you did your analysis of just the public aid 

11:13:58 

9 


population, correct? 

11:14:00 

10 

A. 

Yes. 

11:14:00 

11 

Q. 

Why didn't you — 

11:14:02 

12 

A. 

To the best of my recollection. 

11:14:04 

13 

Q. 

Why didn't you do that in the refined model, the 

11:14:08 

14 


diminished health status model, or the nursing home 

11:14:10 

15 


model? 

11:14:10 

16 

A. 

And my answer, which I think I gave previously, was 

11:14:18 

17 


that to the best of my knowledge we did account for 

11:14:20 

18 


insurance in those — in some of those models that 

11:14:24 

19 


you have just referred to. 

11:14:26 

20 


I don't know if we've referred to — if we 

11:14:30 

21 


included in all of the models, I'd have to look back 

11:14:32 

22 


at the results. 

11:14:32 

23 

Q. 

Do you consider that you have accounted for 

11:14:38 

24 


insurance status when you have it as a variable in a 

11:14:40 

25 


regression? 

11:14:42 
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1 A. You are accounting for it, yes. 

2 Q. Are you accounting for it in the same way that you 

3 are when you do your entire analysis in the Medicaid 

4 population, for instance? 

5 A. I don't think I understand your question. 

6 Q. Well, it's this: If you — you take what you did in 

7 the core model? 

8 A. Right. 



9 

Q. 

Where you did your analysis of conditional 

11:15:16 

10 


probabilities, if you want to call it that, on a 

11:15:20 

11 


Medicaid population. Okay? 

11:15:22 

12 

A. 

Okay. 

11:15:22 

13 

Q. 

And let's suppose that in the refined model you do a 

11:15:28 

14 


regression on the whole NMES population, all 24,000 

11:15:36 

15 


people, and you have insurance as a covariate. Do 

11:15:40 

16 


you consider those to be the same thing? 

11:15:42 

17 

A. 

No, but the core model was never intended to be the 

11:15:46 

18 


same as the refined model. They're not the same 

11:15:48 

19 


thing. 

11:15:48 

20 

Q. 

Just to make sure, I think you answered this 

11:16:22 

21 


question, but let me make sure. Let's suppose that 

11:16:24 

22 


you did a regression on just a Medicaid population. 

11:16:32 

23 


okay? 

11:16:32 

24 

A. 

Yes. 

11:16:34 

25 

Q. 

And then you did a regression on the whole NMES 

11:16:40 


94 


1 population, all 24,000, and had an insurance 

2 covariate. Do you consider those to be the same? 

3 A. No, they're not the same. One is a regression on a 

4 subset of the people in the other. 

5 Q. I took that to be your answer, but he didn't think 

6 so. Forced us to do a second round. 

7 THE WITNESS: He's a statistician, though, 

8 right? 

9 MR. SILFEN: Oh, oh. Does that mean he 

10 was right? 
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11 


THE WITNESS: 


I don't know. 


12 BY MR. SILFEN: 

13 Q. Well, actually, it then comes back to the same 

14 question. I mean, your answer was, well, they're 

15 not the same model. 

16 But if the Medicaid population, for 

17 instance, was the proper population in which to 

18 figure the Medicaid smoking attributable fraction, 

19 why didn't you do the refined model on the Medicaid 

20 population? 

21 A. I wouldn't agree that, you know, I mean, you 

22 premised it by saying it was the right analysis to 

23 do it. I would not agree with that. 

24 One major difference between the refined 

25 model and the core model is that the refined model 


11 

11 

11 

11 

11 

11 

11 

11 

11 

11 

11 

11 

11 


1 

2 

3 

4 

5 

6 Q. 

7 

8 

9 A. 
10 
11 
12 
13 
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includes many other variables by which the Medicaid 11 

population may well be different than the Blue 11 

Cross/Blue Shield population. 11 

For example, measures of socioeconomic 11 

status and the like. 11 

Your first answer to me, though, is they weren't 11 

intended to be the same model. What was your 11 

intention in doing a core model? 11 


The intention in creating the core model was to make 11 


a simple and more easily understood calculation that 11 
could be compared with the results from the more 11 
complex refined model as a validity check, so we 11 
would feel more comfortable that the refined model 11 
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14 


wasn't missing something or didn't need additional 

11:18:52 

15 


work. 

11:18:54 

16 


And to make clear for ourselves the 

11:18:58 

17 


components of the refined model, that is, the three 

11:19:04 

18 


reductions inherent in that model. 

11:19:08 

19 

Q. 

If you want to know the smoking attributable 

11:19:32 

20 


fraction for a Medicaid group, wouldn't it always be 

11:19:40 

21 


preferable to take your measurement in a Medicaid 

11:19:44 

22 


group? 

11:19:44 

23 

A. 

What measurement are you referring to? 

11:19:48 

24 

Q. 

The measurement of the relative incidence of expense 

11:19:54 

25 


between smokers and nonsmokers? 

11:19:56 


96 


1 

A. 

No. 

11:19:56 

2 

Q. 

It wouldn't be preferable? 

11:20:00 

3 

A. 

You asked would it always be preferable, and the 

11:20:04 

4 


answer is no. 

11:20:06 

5 

Q. 

Would it be preferable in this case? 

11:20:08 

6 

A. 

Not necessarily. 

11:20:10 

7 

Q. 

Well, under what circumstances would it not be 

11:20:16 

8 


preferable to measure the relative health care 

11:20:20 

9 


utilization of smokers and nonsmokers for a Medicaid 

11:20:24 

10 


population in a Medicaid population? 

11:20:26 

11 

A. 

When there was limited information about the 

11:20:32 

12 


Medicaid population and useful relevant information 

11:20:36 

13 


in an otherwise similar population from which you 

11:20:38 

14 


can borrow strength. 

11:20:40 

15 

Q. 

And where is it that you think there was limited 

11:20:42 
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16 


17 A. 

18 Q. 

19 

20 

21 A. 

22 

23 

24 

25 


information about the Medicaid population? 

You were asking me, I thought, a hypothetical. 

Well, I was. But does the hypothetical have any 
application here? Is it your view that you had 
limited information on the Medicaid population? 

I used the word limited to mean less than is 
available if you use a bigger population. 

Yes, that is the case here. We have more 
information available than just for those people who 
were on Medicaid. We have information on some 
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1 

2 Q. 

3 

4 A. 

5 

6 Q. 

7 


9 

10 

11 

12 

13 

14 

15 

16 

17 

18 


A. 

Q. 

A. 


Q. 


A. 


30,000 people. 

So it's not information, it's the number of people, 
sample size you're talking about? 

Well, sample size as applied to, you know, it's more 
than just sample size. 

Well, but are you suggesting that you had more 
substantive information available for persons not on 
Medicaid than for persons on Medicaid? 

No, I wasn't suggesting that. 

Then what are you saying other than sample size? 

What I'm saying is that in statistical estimation, 
it is often better to fit a model that uses more 
information on a broader group of people in order to 
make an estimate for a particular group. 

I don't know what you mean by more information. Do 
you mean more quantity or different information? 
Well, for example, if I wanted to know the 
probability of having lung cancer among smokers and 
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19 

20 
21 
22 

23 

24 

25 


I had two smokers in this room, it would be unwise 
for me to try — to ask them whether they had lung 
cancer, and then to make my estimate for the State 
of Minnesota, for example, from people from this 
room. 

Whereas, if I had additional information, 
say from NMES, even for the entire country, that 
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1 

2 

3 Q. 

4 

5 

6 A. 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 Q. 

17 

18 

19 A. 

20 Q. 


might be a better way to go about estimating the 
rate of — 

Am I hearing something other than the number of 
people, the sample size? I think I'm hearing the 
sample size? 

In the context of estimating a simple rate, you can 
interpret what I said strictly in terms of sample 
size. 

In the context of a regression analysis, 
the amount of additional information that comes from 
having more people depends — is different depending 
on what their covariate values are, is different 
from what the predictor variables are. 

Certainly, I'm referring largely to sample 

size. 

I want to get something absolutely clear, that your 
calculations in your regressions are based on values 
from NMES, right? 

Correct. 

And the values from NMES are the same for the 
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21 


Medicaid subpopulation and for the rest of the 

11:23:44 

22 


population, right? 

11:23:44 

23 

A. 

You say — 

11:23:46 

24 

Q. 

The information available. 

11:23:50 

25 

A. 

Well, the information available, the variables which 

11:23:54 
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are available, may be the same. But the values that 
those variables take, for example, the age 
distribution, or the gender distribution, may be 
quite different for one population versus another. 
Okay. Well, for reasons other than sample size, I 
mean, if they're different because it's a Medicaid 
subpopulation you want it to be different, don't 
you? 

Isn't that exactly why you want to do it 
on a Medicaid subpopulation? 

I don't want it to be one way or the other. I don't 
have a — I mean, I have no expectation about that. 
Were you a part of discussions about whether or not 
the refined model and the diminished health status 
model and the nursing home model should be done in 
insurance subgroups? 

The models would be the — I don't recall specific 
discussions. 

Okay. Now let's talk about the refined model. When 
I was working with Dr. Miller on the refined model, 
it was sometimes useful to use a paper that he had 
prepared. It was marked as Defendants' Exhibit 2324 
at the Miller deposition. 
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24 

25 


Do you want to take a look at that and 11:26:32 

tell me if you recognize it or if perhaps you even 11:26:34 


100 


1 


reviewed it when you looked at the Miller 

11:26:36 

2 


deposition? 

11:26:38 

3 

A. 

I did not review it when I looked at his 

11:26:40 

4 


deposition. 

11:26:42 

5 

Q. 

Had you seen it before then? 

11:26:44 

6 

A. 

I've seen a document that has some of this in it. I 

11:26:50 

7 


don't know if it's this one in particular. 

11:26:52 

8 

Q. 

The important question would be this: On page 13 of 

11:27:02 

9 


this document, which is Leonard Miller 2324, we see 

11:27:12 

10 


what was identified by Dr. Miller as equation 3.1. 

11:27 :18 

11 


Do you see that in the middle of the 

11:27:20 

12 


page? 

11:27:20 

13 

A. 

I do. 

11:27:22 

14 

Q. 

And he identified that as the — the major portion 

11:27:30 

15 


of the ultimate SAF formula in the refined disease 

11:27:38 

16 


model. 

11:27:40 

17 


Do you recognize it as that? Or does it 

11:27:44 

18 


look to you like it looks to me, a mess? 

11:27:46 

19 

A. 

It looks like the correct calculation to make to 

11:27:52 

20 


calculate the smoking attributable fraction. 

11:27:56 

21 

Q. 

Good, because that will allow us to work through 

11:28:04 

22 


that. 

11:28:04 

23 


Let me ask you to go back again to the 

11:28:06 

24 


pot, though. We agreed that the — well, by the 

11:28:24 

25 


way, let me start this way: The pot total for the 

11:28:28 
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1 


refined model is calculated the same way as it was 

11:28:34 

2 


in the core model, correct? 

11:28:34 

3 

A. 

For the specific diseases? 

11:28:38 

4 

Q. 

Yes. For lung cancer and COPD, you identified 

11:28:42 

5 


people who had those conditions and then for them 

11:28:44 

6 


you took all their costs incurred in a year? 

11:28:48 

7 

A. 

Correct. 

11:28:48 

8 

Q. 

And we agreed previously that for a lung cancer 

11:28:58 

9 


person that that could include costs for CHD, if he 

11:29:02 

10 


had it during that year, for other cancers, for 

11:29:04 

11 


diminished health status, and maybe even for car 

11:29:10 

12 


accidents. 

11:29:12 

13 


That's the conversation we had previously. 

11:29:14 

14 


correct? 

11:29:14 

15 

A. 

And for going to the doctor to — 

11:29:18 

16 

Q. 

For a hangnail? 

11:29:18 

17 

A. 

For anything, yeah. 

11:29:20 

18 

Q. 

Now, when we come to the third reduction step in the 

11:29:26 

19 


refined model, what we're going to subtract is the 

11:29:32 

20 


expected expense for a nonsmoker without a major 

11:29:40 

21 


smoking-related disease, correct? 

11:29:42 

22 

A. 

Are you referring to a particular place in the 

11:29:46 

23 


report or into this equation? 

11:29:48 

24 

Q. 

I'm referring to the principle of the third 

11:29:50 

25 


reduction. 

11:29:52 
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1 A. 

2 

3 

4 

5 

6 

7 

8 
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10 


11 

Q 

12 


13 

A 

14 

Q 

15 


16 


17 

A 

18 

Q 

19 

A 

20 

Q 

21 

A 

22 


23 


24 

Q 

25 

A 


Actually, the second and third reductions are 
combined in the refined model equation 3.1, in this 
document you've handed me, splits the SAF into two 
pieces, the first piece which corresponds to the 
first reduction as was done in the core model. 

And then a second piece that corresponds 
to the second and third reductions taken together. 

I wouldn't be able — I don't think I could 
disaggregate that third reduction based upon the 
simple quantity — 

You have laid down a weighty challenge here. Now 
we're going to go through this equation. 

Great. 

Here we go. Okay. You've agreed that equation 3.1 
appears to be the correct formula. Can you 
translate it for me into narrative English? 

Yes . 

Okay. Let's do it. 

Would you like me to narrate it? 

Yes, I would like you to do it. 

Well, actually equation 3.1 is the expected smoking 
attributable expenditures, not the smoking 
attributable fraction. 

It lacks the denominator? 

Right. 
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1 Q. We'll put that aside for the moment. 11:32:00 
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2 A. 


Okay. Well, it has two pieces. 


The first piece is 


11:32:06 


3 the rate of smoking among people with the disease. 11:32:16 

4 Here written, Prob, open parenthesis, [smokei=l, 11:32:24 

5 vertical line, open curly bracket {currtrdi=l}], 11:32:26 

6 unclose curly bracket, close bracket. 


7 


That first piece before the star 

11:32:32 

8 


represents in narrative English the fraction of 

11:32:36 

9 


people among the fraction of people with the disease 

11:32:40 

10 


who are smokers. 

11:32:42 

11 

Q. 

Okay. That's the first reduction step? 

11:32:46 

12 

A. 

That is the first reduction step as laid out in the 

11:32:48 

13 


core model. 

11:32:48 

14 

Q. 

All right. 

11:32:52 

15 

A. 

Okay. And then there's a second piece to the 

11:32:56 

16 


equation, which starts with the third line and goes 

11:33:00 

17 


to the end. 

11:33:00 

18 

Q. 

Right. 

11:33:04 

19 

A. 

Okay. And that piece compares the expenditures 

11:33:10 

20 


among people who are — among smokers who are 

11:33:14 

21 


currently treated. That's the first term on the — 

11:33:20 

22 


the term on the third line. 

11:33:22 

23 

Q. 

The expense for a smoker with lung cancer or with a 

11:33:28 

24 


smoking-related disease? 

11:33:28 

25 

A. 

Right. It compares that expense to the expense that 

11:33:32 
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1 would be expected if there were not — m a 

2 population of nonsmokers. 

3 And that expected expense in the 

4 population of nonsmokers has two pieces because in 



5 

6 

7 

8 
9 

10 Q. 

11 
12 

13 

14 

15 A. 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


the population of nonsmokers we may have some people 
who have lung cancer. 

So we take their expenditures and we 
average them with the expenditures for people who 
aren't — who don't have lung cancer. 

Tell you what, I was trying to catch up. Can you 
just go back and — I think we agreed that we start 
with the expense for a smoker with lung cancer. 

And you were going to — could you go from 
there? I really got lost, I apologize. 

Yes. You take the average expense for the group of 
smokers with lung cancer and we subtract away a 
quantity. So that quantity is the average 
expenditures for the population of nonsmokers. 

And that average expenditure is calculated 
from two kinds of people, a fraction of them 
actually have lung cancer, and they have — tend to 
have different expenditures than the other group 
which don't have lung cancer. 

And this thing we're subtracting away is a 
weighted average of those two sets of expenditures 
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1 weighted by the fraction of people who actually have 

2 lung cancer in the nonsmokers, appropriately 

3 weighted. 

4 Q. That was excellent. I actually understand it more 

5 clearly than I did before. 

6 We have the first term, which we agreed is 
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7 


the first reduction, right? 


11:35:46 


8 A. The one on the second line, the probability of smoke 11:35:50 

9 given current treated? 11:35:50 


10 

Q 

11 

A 

12 


13 

Q 

14 


15 

A 

16 

Q 

17 


18 


19 


20 


21 


22 

A 

23 

Q 

24 

A 

25 

Q 


Yes. 

Yes, that's the same as the so-called first 
reduction in the core calculation. 

And then we're going to multiply that first 
reduction times a quantity? 

Correct. 

And the quantity — I know I'm repeating in part 
what you said, but I — as my wife will tell you, I 
don't understand anything until I say it. 

The quantity begins with the expense for a 
smoker with the condition, a smoking-related 
disease? 

Correct. 

And now we're going to make two subtractions? 

You could think of it as making two subtractions. 

The reason why I'm going to try and think about it 
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1 that way is I'm going to try and make these fit into 

2 the second and third reductions. 

3 A. Okay. You can do that. 

4 Q. I may learn that I can't, but that's what I was 

5 going to try and do. The first reduction has an 

6 expense term and a weighting or probability term, 

7 correct? 

8 A. Correct. 

9 Q. And the expense term is the average expense for a 



10 


nonsmoker with lung cancer? 

11:37:10 

11 

A. 

Correct. 

11:37:12 

12 

Q. 

And the weighting term is the likelihood of a 

11:37:36 

13 


nonsmoker getting cancer, is that correct, or 

11:37:40 

14 


getting a smoking-related disease? 

11:37:40 

15 

A. 

It's the fraction of — it represents the fraction 

11:37:44 

16 


of nonsmokers that have the disease, correct. 

11:37:48 

17 

Q. 

And that is the same as the lung cancers that you 

11:38:04 

18 


would expect the smokers to get if they didn't 

11:38:10 

19 


smoke, right? 

11:38:12 

20 

A. 

I'm sorry, say that again. 

11:38:16 

21 

Q. 

We've identified here the baseline rate of expected 

11:38:20 

22 


lung cancer among nonsmokers? 

11:38:24 

23 

A. 

Yes, this probability represents the fraction of 

11:38:30 

24 


lung cancers which you tend to get among a 

11:38:34 

25 


population of nonsmokers, right. 

11:38:36 
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1 

Q. 

And that is the same as the proportion of 

11:38:58 

2 


smoking-related disease that we would expect among 

11:39:04 

3 


smokers even if they didn't smoke, correct? 

11:39:06 

4 

A. 

We've had — 

11:39:10 

5 

Q. 

Our semantic problem. 

11:39:12 

6 

A. 

We've had this discussion. 

11:39:12 

7 

Q. 

We have a semantic problem, but semantics aside 

11:39:16 

8 


you'd agree with me? 

11:39:16 

9 

A. 

I would answer it the same way I answered it 

11:39:20 

10 


before. 

11:39:22 

11 

Q. 

All right. Isn't this, in a sense, the how much 

11:39:34 
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13 A. 

14 Q. 

15 A. 

16 

17 

18 

19 

20 
21 
22 

23 

24 Q. 

25 


extra disease reduction? 1 

No. 1 

Why not? 1 

In the core model, the how much extra disease 1 

reduction is the difference between the disease rate 1 
for the smoking population minus the disease rate 1 

for the nonsmoking population divided by the disease 1 

rate for smoking population. 1 

This is one of the terms in that 1 

reduction, but it's not the same as that reduction. 1 

Particularly, it's multiplied by an expenditure, as 1 

well. 1 

The third term is the expense for a nonsmoker 1 

without the condition, correct? 1 


108 


1 A. 

2 Q. 

3 A. 

4 

5 

6 Q. 

7 


8 A. 

9 

10 

11 

12 

13 

14 


The third — let's see the — the last term — 1 

Second subtraction, I'm sorry. 1 

Yes, that has expenditures for persons who are 1 

neither smokers nor currently treated for the 1 

smoking attributable disease. 1 

Well, as you've just described it, why isn't that 1 

the third reduction? 1 

Well, that's a component of the third reduction. 1 

The other component is up in that second piece we 1 

just talked about. 1 

The other component of the third reduction 1 
is the expenditures for people who — I'm sorry, the 1 
expenditures for people who have the disease — 1 

let's see, you've got me confused here. 1 
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15 

Q. 

I'm glad you're confused. That's very comforting. 

11:42:00 

16 

A. 

Yeah. 

11:42:00 

17 

Q. 

Why don't we leave it until after the break. I'd 

11:42:12 

18 


like to think about what you said and maybe it will 

11:42:14 

19 


sink in and I'll see it. 

11:42:16 

20 


It is clear, however, is it not, that in 

11:42:42 

21 


these two subtraction steps, we are accomplishing 

11:42:48 

22 


the reductions described as your second and third 

11:42:58 

23 


reductions? 

11:43:00 

24 

A. 

Yes. 

11:43:02 

25 

Q. 

Good. Now, let's go back to the point we were 

11:43:06 
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1 


working on before. If we go back to your 

11:43:20 

2 


description of the third reduction on page 8, the 

11:43:50 

3 


quantity we're identifying there is the average 

11:43:54 

4 


expenditures of never-smokers without the disease. 

11:43:58 

5 


correct? 

11:43:58 

6 

A. 

We're on page — 

11:44:00 

7 

Q. 

That's at the very end of the how many more 

11:44:06 

8 


dollars. We're going to turn that into a 

11:44:08 

9 


proportion? 

11:44:10 

10 

A. 

We take the difference between the average 

11:44:12 

11 


expenditures for smokers with the disease, which is 

11:44:16 

12 


the first term here. 

11:44:18 

13 

Q. 

Yes . 

11:44:20 

14 

A. 

And we subtract away from that the average 

11:44:22 

15 


expenditures of never-smokers. 

11:44:24 

16 

Q. 

Without — 

11:44:26 
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17 A. Without the disease. 

18 Q. Okay. Now, we agreed previously that when we were 

19 putting our pot together for a person with lung 

20 cancer, forgive me for repeating this, but the 

21 record probably wouldn't read if I didn't, for the 

22 person with lung cancer, that person in that year 

23 may also have CHD, may also have other cancers, may 

24 also have diminished health status, may also have 

25 other costs, car accidents, doctor visits, right. 
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1 

2 A. 

3 Q. 

4 

5 

6 A. 

7 Q. 


9 

10 

11 

12 A. 

13 Q. 

14 A. 

15 

16 Q. 

17 

18 
19 


and they all go into the pot? 

Correct. 

Now, when we identify expenditures, the average 
expenditures expected for a nonsmoker without a 
major disease — 

Right. 

— isn't it correct that we are going to leave in 
that pot the lung cancer/COPD costs, the CHD/stroke 
costs, all of the other major smoking-related 
disease costs, and any diminished health status due 
to smoking? 

That's my understanding, yes. 

Now — 

And that due to smoking modified all the things that 
were left in. 

Yes, I agree. Well, let me back up on that. In a 
way I think that puts the question I'm asking. 

To get to the smoking attributable portion 
for a lung cancer person, you did an analysis for 
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20 

21 

22 

23 A 

24 

25 


1 

2 Q 

3 

4 A 

5 Q 

6 A 

7 

8 Q 

9 

10 A 

11 Q 

12 A 

13 

14 Q 

15 A 

16 Q 

17 

18 

19 

20 A 

21 Q 


your first and second reductions of the relative 11:46:40 


incidence of lung cancer among 
nonsmokers, correct? 

The relative, the rate of lung 
and nonsmokers, as well as the 
among the lung cancer — among 


smokers 

and 

11:46:44 



11:46:44 

cancer 

among smokers 

11:46:52 

fraction of smokers 

11:46:56 

people 

who had lung 

11:47:02 


111 


cancer. 

So those reduction steps which get you to a smoking 
attributable amount are based — 

First two steps. 

— are based on lung cancer? 

Are based upon persons who have, among other things 
that they might have, lung cancer and COPD. 

I know, but the percentage is based on the number of 
smokers among the people with lung cancer? 

Who have at least lung cancer? 

Yes. 

And COPD, right, or COPD, and may have other 
things? 

Right. 

Right. 

Well, then, doesn't that mean that you are going to 
be applying first and second reduction steps that 
are based on lung cancer to dollars that are spent 
on heart disease and diminished health status? 

That's a possibility. 

Is that something that you fellows discussed? 
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A. 

We have discussed it. 




11:48:02 

23 

Q. 

And I 

may be running into 

a 

work product 

objection 

11:48:10 

24 


here. 

but is — what have 

you discussed? 


11:48:12 

25 

A. 

Well, 

we discussed what the 

options were. 

The 

11:48:22 
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options were laid out and we tried to identify the 
best strategy, the fairest strategy, given the 
options, and this is what we did. 

I take it you're talking about conversations that 
occurred before the report was put in? 

Some of them occurred before and some of them 
occurred subsequently. 

What were the options that were discussed? 

Well, we were unable to separate the expenditures 
for a person, for example, who has both lung cancer 
and suffered a stroke. 

It's hard for us to separate out how many 
dollars go to the lung cancer and to the stroke, so 
far as I understand from Dr. Wyant. 

So we could have taken all of those, and 
I'm using that as an example. I don't know that 
particular case. 

I understand. 

Let me, in fact, correct myself. We have a person 
who has lung cancer and some other smoking 
attributable disease. It's difficult to sort out 
there expenditures as this one is due to that and 
this one is due to that. So we had to take them as 
a lump. 
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So we could either allocate those 
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expenditures to the lung cancer case or we could 
allocate them to the other pot. Those were our two 
choices. 

And, of course, whatever we did we would 
do the same thing to be fair, do the same thing with 
NMES calculations, as well. 

And so this was the choice we made. One 
factor — well, this was the choice we made. 

Have you done a sensitivity analysis to — well, one 
I asked you about in the beginning was to see how 
many dollars we were talking about. You have not 
done that? 

I have not done that, no. 

And since you haven't done that, I take it you also 
haven't done an analysis to see what the effect may 
be of applying the lung cancer reduction step to 
heart disease dollars. You have no idea what that 
effect is? 

I've not done that calculation, no. 

Wouldn't it be pretty clear that — I'm going to do 
this A and B. If, A, the nonlung cancer dollars are 
sufficiently high, and, B, the difference in the 
lung cancer and heart disease and diminished health 
status reduction steps is sufficiently high that the 
result has been skewed? 
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1 

A. 

No, I don't agree. 

11:51:16 

2 

Q. 

Understanding that you don't have to accept my A and 

11:51:18 

3 


B, why wouldn't that be true if my A and B are 

11:51:22 

4 


right? 

11:51:22 

5 

A. 

Well, one reason is that the health care costs for 

11:51:28 

6 


having disease B alone and the health care costs for 

11:51:36 

7 


having — well, let me back up. 

11:51:40 

8 


The health care costs of having disease A 

11:51:42 

9 


and B together may be very different than the sum of 

11:51:46 

10 


the health care costs of having A or having B. 

11:51:52 

11 


So, for example, if you're trying to treat 

11:51:54 

12 


a person with heart disease, but that person also 

11:51:58 

13 


has lung cancer, the costs of their heart disease 

11:52:02 

14 


may be very much larger. 

11:52:06 

15 


In fact. Dr. Samet has suggested maybe 

11:52:10 

16 


very much larger than would be their costs had they 

11:52:12 

17 


not had lung cancer. And those additional costs are 

11:52:14 

18 


appropriately attributed to the — to smoking. 

11:52:18 

19 

Q. 

Well, that — 

11:52:20 

20 

A. 

To the extent to which their lung cancer — 

11:52:22 

21 

Q. 

Which they're attributed to the lung cancer? 

11:52:24 

22 

A. 

Yes. 

11:52:26 

23 

Q. 

You could just as easily reason it the other way. 

11:52:28 

24 


though, and I know you're not arguing otherwise. 

11:52:30 

25 

A. 

Yes. So that's one reason why what we're doing is 

11:52:36 
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7 A. 
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13 A. 

14 

15 

16 

17 

18 

19 

20 
21 

22 Q. 

23 

24 

25 


valid. Let me be clear. That's one reason why I 
wouldn't agree with your premise that you had put in 
the question to me. 

If you were doing a journal article on this subject, 
wouldn't you consider it to be incumbent on you to 
measure the impact of this choice that you've made? 
It wouldn't be incumbent upon me, no. In analysis, 
there are many choices that one makes. One does the 
best to look at the impact of the various choices 
and to do sensitivity analyses. 

Perhaps I shouldn't have said incumbent. Wouldn't 
it have been expected and probably preferable? 

Not if — what I was trying to say in my response 
was that you do your best to put in priority the 
things that need to be looked at. 

And, you know, we are looking at things. 
We're continuing to look at things. And we've gone 
for the highest priorities, ones based upon our 
judgments of the ones that would make the biggest 
difference, potentially make the biggest 
difference. 

Now, in the formula that we've just worked our way 
through, I think we have some hangover questions on 
it which I'm going to try and think about at lunch. 

But where I would go next is where I went 
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1 with Dr. Miller, which is to try to understand how 11:55:42 

2 the components of that formula were derived. And 11:55:48 
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3 


the formula seems to me to be made up of two chunks 


11:55:56 


4 


of information. 

11:55:58 

5 


One chunk is the three conditional 

11:56:04 

6 


probabilities and the other chunk is the three 

11:56:10 

7 


expense numbers, smoker with, nonsmoker with and 

11:56:18 

8 


nonsmoker without. 

11:56:20 

9 


Is that a fair shorthand of what's in 

11:56:22 

10 


here? 


11 

A. 

You're referring to the piece that has to do with 

11:56:26 

12 


the second and third reductions? 

11:56:28 

13 

Q. 

Actually, I'm referring to everything. 

11:56:30 

14 

A. 

Everything, sorry. Yes, there are six terms there 

11:56:32 

15 


and you've listed them, yes, that's correct. 

11:56:34 

16 

Q. 

Well, that's the level of math at which I'm really 

11:56:38 

17 


good. 

11:57:18 

18 


Okay. Let's talk about the conditional 

11:57:20 

19 


probabilities. I'm not sure whether we coined that 

11:57:26 

20 


term or you did. Is that a term that makes sense in 

11:57:28 

21 


this context? 

11:57:28 

22 

A. 

Conditional probability is a term used in 

11:57:34 

23 


probability or in statistics. 

11:57:38 

24 

Q. 

I know I didn't coin it. 

11:57:40 

25 

A. 

Somebody must have. 

11:57:40 
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1 Q. Somebody must have said it to me. The probabilities 

2 that we're going to derive, let's take lung 

3 cancer/COPD first because in refined model it's done 

4 differently from lung cancer/COPD and CHD/stroke? 

5 A. Correct. 
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8 A. 
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17 Q. 

18 

19 

20 
21 
22 

23 

24 

25 


Who made the decision, by the way, to use different 
methodologies within the refined model? 

I would actually call it the same methodology. The 
idea of estimating the terms in this equation 3.1. 

A different estimation method was used. The 
approach or the method, the overall method, is the 
same, but the specific approach to estimating these 
unknown probabilities was different, yes. 

Who made that decision? That was based 
upon discussions largely with Dr. Samet. The 
decisions are made, you know, by all of us. 

Okay. Now, I think of it in terms of three 
principal conditional probabilities. Given lung 
cancer, what is the probability that you smoke? And 
given that you smoke, what is the probability of 
lung cancer? And then given that you don't smoke, 
what is the probability of lung cancer? 

I know those aren't exactly what occur in 
the equation, but I think those are the 
probabilities that are derived in order to get to 
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1 the quantities in the equation. 

2 A. I think it's true that those three conditional 

3 probabilities you stated would be sufficient to get 

4 us to the three that appear in this equation. 

5 Although, I'm not — 

6 Q. That is a very congenial thing to say, because I 

7 think that's exactly what I meant to say. That I 
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think those are the three conditional probabilities 


11:59:54 


9 that are sufficient to get you to the quantities in 12:00:00 
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24 

Q 

25 



the equation. That is exactly what I meant to say. 
But we'll check on that as we go along. 

First methodology. For lung cancer, let's 
just take one we agree on, given lung cancer what is 
the probability that you smoke? That we know you 
had to do because that's the first term in the 
equation. 

Correct. 

And the way that is done, it's done in NMES, and 
it's done for age, gender — by age, gender, and 
it's just a simple count. If there are 1,000 lung 
cancers and 900 of them are smokers, it's going to 
be 900 over 1,000, or 90 percent? 

I think the — I think that's correct. 

Okay. And similar for the other conditional 
probabilities, it is simply an age/gender count in 
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1 those? 12:01:04 

2 A. I'm going to just reserve my answer. I can't 12:01:12 

3 remember exactly what the strata were. I think it's 12:01:18 

4 just age and gender, but I don't have a perfect 12:01:20 

5 recollection of that. 12:01:20 

6 But that's right, it's the frequency with 12:01:22 

7 which these events occur in the population as 12:01:26 

8 indicated by the conditioning in these 12:01:30 


9 probabilities. 12:01:32 

10 Q. And for CHD/stroke, these conditional probabilities 12:02:00 
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11 


are derived by a form of regression, correct? 

12:02:06 

12 

A. 

Correct. 

12:02:08 

13 

Q. 

And tell me the name of the form of the regression 

12:02:12 

14 


because I always say it wrong. 

12:02:14 

15 

A. 

Well, it's a probit regression. 

12:02:18 

16 

Q. 

Well, isn't it a binumeral variate joint 

12:02:24 

17 


distribution, maximum likelihood thing, something 

12:02:30 

18 


like that? 

12:02:32 

19 


MR. HAMLIN: Objection to form. 

12:02:32 

20 

BY 

MR. SILFEN: 


21 

Q. 

I could get it right if I wanted to, there's just 

12:02:38 

22 


some things I don't care to learn. I'm too old. 

12:02:40 

23 

A. 

Would you mind repeating the question? No, it was 

12:02:44 

24 


a — the model was what you might call a bivariate 

12:02:50 

25 


model, meaning that it was a model for currently 

12:02:56 
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Q. 


A. 


6 Q. 

7 


9 A. 
10 
11 

12 Q. 


treated and smoking as the two outcomes. It was of 
the probit form. 

So you did a first regression, which is whether you 
are a smoker given your other characteristics? 

No. 

I thought that you did two regressions. In one the 
outcome is whether you're a smoker, and the other is 
whether you're being currently treated? 

Well, it's two regression equations fit together 
because you're estimating the joint distribution of 
these two currently treated and smoking variables. 
Okay. But the two regressions fit together are. 
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13 


first, whether you are a smoker as the outcome as a 

12:03:56 

14 


function of your characteristics, whatever you were 

12:04: 00 

15 


using as your covariates? 

12:04:02 

16 

A. 

Correct. 

12:04: 02 

17 

Q. 

And the second is whether you are currently being 

12:04:06 

18 


treated for CHD/stroke as a function of your other 

12:04:10 

19 


covariates, but not smoking? 

12:04:14 

20 

A. 

Correct. 

12:04:16 

21 

Q. 

And from that, you derive a correlation coefficient 

12:04:20 

22 


between disease and smoking? 

12:04:24 

23 

A. 

Well, just as the regression parameters in those two 

12:04:34 

24 


regressions are parameters of this analysis, so. 

12:04:38 

25 


too, is the correlation coefficient another 

12:04:40 
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parameter. 

It's another thing that we estimate when 
we do the simultaneous bivariate regression. 

It's going to work like the coefficients in the — 
Think of it as another parameter to be estimated 
from the joint information about current treated and 
smoking. 

Now, how was it decided to do this bivariate 
analysis, and why was it decided to do a bivariate 
analysis? 

Well, what we needed to get out of this analysis 
were these probabilities, these three probabilities 
that you've referred to that are in equation 3.1 in 
this document you've given me. 

Right. 
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16 A. The fourth probability is determined from the 

17 others. So really what we needed to do was to 

18 estimate the probabilities in a little two-by-two 

19 table. 

20 And so the way to do that is to fit a 

21 model that estimates those probabilities and allow 

22 them to be a function of other factors which we 

23 wanted to control for. 

24 And that's what — this is sort of a 

25 standard method that's used for that purpose. 
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1 standard statistical technique. 12:06:16 

2 Q. I guess — I understand you answered a question. I 12:06:22 

3 had something different in mind. In NMES — which 12:06:28 

4 is what you're using here, correct? 12:06:32 

5 A. NMES was used to estimate this bivariate model, 12:06:38 

6 yes. 12:06:38 

7 Q. You have smoking information. You have other 12:06:44 

8 covariates. You have incidence of disease. And you 12:06:52 

9 have expenditure for disease. You have all that 12:06:54 


10 information, correct? 

11 A. Correct. 

12 Q. And the outcome you're interested in here is 

13 expenditure for disease, isn't it, ultimately? 

14 A. Well, that's one of the outcomes. 

15 Q. Well, I'm not talking about the methodology you 

16 decided on here, I'm talking about the end of the 

17 day. At the end of the day, you wanted to know the 


12:06:56 
12:06:56 
12:07:06 
12:07:12 
12:07:14 
12:07:18 
12:07:20 
12:07:22 


http://legacy.library.ucsf.@3iJUijft^i^c|(i6§a0QfipBif.industrydocuments.ucsf.edu/docs/rqfl0001 



18 


19 

20 
21 
22 

23 

24 

25 


A. 

Q. 


A. 


expenditures — 

Which were attributable to smoking. 

— which were attributable to smoking. 

Why wouldn't one have specified a model in 
which the outcome was expenditure and the predictors 
were the covariates that you were interested in? 
Because that model wouldn't address the question 
that we've addressed in this study. 
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123 

I think I'm puzzled by that. I don't know why you 
would get a — if smoking were a predictor, you 
would get a coefficient for smoking, the 
contribution of smoking to predicting expenditure, 
correct? 

You would get a regression coefficient for smoking 
if it were a variable in an expenditure model. 

And once with that, you could surely determine a 
smoking attributable fraction, couldn't you? 

I don't know. 

Well, I mean, I can think of lots of easy ways to do 
it, and I'm an English major. For instance, you 
could — you estimate the model on your smokers and 
nonsmokers, you get coefficients, and then you put 
the smokers back in the model but you change only 
one thing, you change their smoking coefficient, 
make them nonsmoker, see the difference, why not? 

You know, I don't know. I mean, I don't know how to 
answer it because I didn't do that, and I've not 
thought carefully about what you're proposing as to 
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what its problems might be. 

It 

' s not something I 

12 

22 

did or 

thought carefully about 

the possibility of 

12 

23 

doing. 




12 

24 Q. 

Yeah, 

but when one looks at 

the 

schematic of what 

12 

25 

you've 

done here, why? Why 

did 

you do all this? 

12 


0 

0 

0 

0 
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4 A. 

5 Q. 
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9 Q. 
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11 A. 

12 Q. 
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14 

15 

16 

17 A. 

18 

19 

20 

21 Q. 

22 


I'm quite serious, why? You had the outcome in 12:0 
NMES, which is expense. You had all the 12:0 
covariates. Why did you do all this? 12:0 
Why did we do the refined model? 12:0 
Why didn't you just build a model to predict the 12:0 
outcome that was of interest here? 12:0 
We did build a model to estimate the smoking 12:1 
attributable fraction of dollars. 12:1 
Why didn't you do it in the most obvious and direct 12:1 
way? 12:1 
We did it in an obvious and direct way. 12:1 
Was it discussed that the information was available 12:1 
to you to estimate a simple model in which the 12:1 
outcome was expense and the covariates were disease 12:1 
and any other variables you were interested in, was 12:1 
that discussed? 12:1 
Our analysis is a regression analysis that estimates 12:1 
the quantities necessary to calculate the smoking 12:1 
attributable fraction. And I'm not going to 12:1 
speculate further about what we — 12:1 
That wasn't my question. My question was whether 12:1 
there was — I'm not asking you to speculate on the 12:1 


9:18 

9:24 

9:24 

9:30 

9:36 


9:40 
9:42 
9:48 
9:52 
9:56 
9:56 
0:04 
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23 


value of another model. 


12:11:08 


24 I'm asking you — really I'm asking you 12:11:10 

25 questions about this model. Why this model? Was it 12:11:12 


125 


1 discussed that you could have specified a very 

2 simple direct model in which the outcome was expense 

3 and the covariates, the predictors, were smoking and 

4 the other variables of interest? Was it discussed? 

5 MR. HAMLIN: Objection; asked and 

6 answered. 

7 MR. SILFEN: No, he hasn't answered. 

8 THE WITNESS: I don't recall a discussion 

9 of the specific model that you've proposed, as I 


10 


understand it 

• 


11 

BY 

MR. SILFEN: 



12 

Q. 

Well, can you 

tell me what is the advantage of your 

13 


model over the one that 

I suggest? 

14 

A. 

I don't know 

because I 

don't know specifically what 

15 


model you are 

suggesting. 

16 


MR. 

SILFEN: 

Let's break for lunch. 

17 


MR. 

HAMLIN: 

Do you mind going until about 

18 


12 : 30 or not? 



19 


MR. 

SILFEN: 

What's up? 

20 


MR. 

HAMLIN: 

Just that we've got lunch 

21 


reservations 

for a little bit later. 

22 


MR. 

SILFEN: 

Tom, I'm going to do this for 

23 


you. My questions will 

get a little weak, but 

24 


you'll go with me, won' 

t you? 

25 


THE 

WITNESS: 

I'll do my best. 
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1 

2 


MR. SILFEN: Let's do it. 

(Discussion held off the written record.) 

12:12: 42 

3 


MR. HAMLIN: This is off the record. 

12:12:46 

4 


THE VIDEOGRAPHER: We are temporarily 

12:12:50 

5 


going off the video record. The time is 12:12 p.m. 

12:13:02 

6 


(Off the record.) 

12:13 : 44 

7 


THE VIDEOGRAPHER: Back on the video 

12:14:14 

8 


record. The time is now 12:14 p.m. 

12:14:16 

9 

BY 

MR. SILFEN: 


10 

Q. 

I had a piece prepared here where I convinced myself 

12:14:22 

11 


and everyone else that you could take those three 

12:14 :24 

12 


conditional probabilities and turn them into the 

12:14 :28 

13 


terms in the final equation, and I think you 

12:14:34 

14 


relieved me of that. 

12:14: 34 

15 


You're pretty sure that you can do that? 

12:14:36 

16 


Those are the sufficient probabilities, the ones we 

12:14:40 

17 


discussed, correct? 

12:14:40 

18 

A. 

I'm not sure because you said them quickly. And I 

12:14 : 44 

19 


think so, but I'm not 100 percent sure. If we wrote 

12:14:48 

20 


them down on a piece of paper, I could tell you for 

12:14:50 

21 


sure. 

12:14: 50 

22 

Q. 

Well, why don't I save — do you want to write them 

12:14:58 

23 


down? 

12:14:58 

24 

A. 

Can I write them as you say them, that would be 

12:15:00 

25 


helpful. 

12:15:00 
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1 

Q. 

We won't belabor it, if you're going to be able 

12:15: 04 

2 


to — given lung cancer, what is the probability 

12:15:08 

3 


that you smoke? Given smoke, what is the 

12:15:14 

4 


probability of lung cancer? And given no smoke. 

12:15:16 

5 


what is the probability of lung cancer? 

12:15:22 

6 


I'd like to see how you write those. He's 

12:15:28 

7 


making a box. Let the record reflect he's making a 

12:15:30 

8 


box. 

12:15:32 

9 

A. 

I believe it's true that these three conditional 

12:15:38 

10 


probabilities would be sufficient for us to 

12:15:40 

11 


calculate the joint distribution of lung cancer and 

12:15:46 

12 


smoking and thereby then calculate the other 

12:15:48 

13 


probabilities listed in this 3.1. 

12:15:50 

14 

Q. 

Thank you. Give me back my pad, but take your piece 

12:15:58 

15 


of paper. 

12:16:00 

16 


You can keep it. I don't care. It 


17 


doesn't look like it would do me any good. 


18 


MR. HAMLIN: Do you want to mark that? 

12:16:06 

19 


MR. SILFEN: No. It took me seven hours 

12:16:08 

20 


to do a proof of that. See this? I did this 

12:16:16 

21 


myself. 

12:16:16 

22 


MR. HAMLIN: Can we mark that, too? 

12:16:26 

23 


MR. SILFEN: I've got 15 minutes to work 


24 


with, we might as well start marking stuff I did. 


25 

BY 

MR. SILFEN: 



128 


1 Q. Okay. We've done the conditional probabilities, now 12:16:32 
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2 


we want to do what I think of as the expense model. 

12:16:36 

3 


And there is an expense regression, correct, in 

12:16:50 

4 


which the outcome is the probability, and outcomes 

12:16:54 

5 


are probability and level of expense? 

12:16:56 

6 

A. 

Correct. 

12:16:58 

7 

Q. 

And what we're talking about there is the 

12:17:02 

8 


probability of any expense? 

12:17:06 

9 

A. 

In the first of the two equations that you 

12:17:10 

10 


mentioned? 

12:17:10 

11 

Q. 

Yeah. 

12:17:12 

12 

A. 

Correct. 

12:17:12 

13 

Q. 

So once again it's not just an expense related to a 

12:17:16 

14 


smoking-related disease, it's any medical expense? 

12:17:20 

15 

A. 

For a person who has that particular disease, yes. 

12:17:26 

16 


so if we're talking about the — well, if we're 

12:17:30 

17 


talking about the CH- — 

12:17:32 

18 

Q. 

I don't think so. Well, then let's get into that. 

12:17:36 

19 


The outcome is — I understand the outcome to be 

12:17:38 

20 


probability of expense, any expense, yes, no? 

12:17:44 

21 

A. 

Yes, that's correct. 

12:17:46 

22 

Q. 

And now what we want to know is the components of 

12:18:00 

23 


that regression. And I understand them to be a 

12:18:14 

24 


group of covariates, things like race and education 

12:18:20 

25 


that vary somewhat, depending on their 

12:18:24 


129 

1 significance? 12:18:24 

2 A. The model, as originally specified, had a set of 12:18:30 

3 variables and there was a reduction of variables 12:18:32 
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4 

5 


6 Q. 

7 


9 A. 

10 Q. 

11 A. 

12 Q. 

13 

14 

15 

16 

17 A. 

18 

19 Q. 

20 A. 

21 Q. 

22 

23 A. 

24 

25 Q. 


based upon their procedure to get to a smaller 
model, yes. 

And then there is a disease term, any 
smoking-related disease, any major smoking-related 
disease, yes, no? 

What model are we talking about? I'm sorry. 

We're talking about the — 

CHD/stroke? 

No, my understanding is there is just one regression 
and it is not disease specific, it's any — the 
variable, the outcome is probability of expense, 
yes, no, or level of expense. And the variable used 
is any smoking-related disease, yes, no. 

But there's a separate modeling effort for the 
diminished health and for the disease modeling. 
Diminished health is separate? 

Not talking about that. 

Not talking about that, talking about the disease 
model. 

So it's for the diseases, and, I'm sorry, ask your 
question again. 

Well, I'm trying to understand the components of 


12:18:36 

12:18:38 

12:18:48 

12:18:52 

12:18:56 

12:19:00 

12:19:04 

12:19:06 

12:19:10 

12:19:14 

12:19:18 

12:19:22 

12:19:28 

12:19:30 

12:19:34 

12:19:36 

12:19:38 

12:19:40 

12:19:40 

12:19:46 

12:19:46 

12:19:50 
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1 

2 

3 

4 

5 

6 


disease regression. I think we 
outcome is probability of expen 
there are covariates in there, 
significance test of the order 
demographics. 

And then I'm talking 


've agreed that 

the 

12:19:54 

se, yes, no, that 

12:19:56 

if they survive 

the 

12:20:02 

of education or 

race. 

12:20:04 



12:20:06 

about a disease 

term. 

12:20:08 
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7 


and I am stating that, as we read the code, the 


12:20:14 


8 disease term is any smoking-related disease, any 12:20:18 

9 major smoking-related disease, yes, no, that it is 12:20:24 


10 


not disease specific? 

12:20:24 

11 

A. 

I'm not sure. I'd have to review. 

12:20:32 

12 

Q. 

Okay. Well, isn't that a somewhat important point 

12:20:38 

13 


because as you've already anticipated, if it's, as I 

12:20:44 

14 


say, any smoking-related disease, that means your 

12:20:46 

15 


expense equation does not distinguish between 

12:20:50 

16 


diseases? 

12:20:50 

17 

A. 

Well, it distinguishes only to the level of the 

12:20:58 

18 


major smoking attributable diseases, if that's 

12:21:00 

19 


true. 


20 

Q. 

And you have no — 

12:21:02 

21 

A. 

I can't remember the specific equation. It's not 

12:21:06 

22 


there right now. 

12:21:06 

23 

Q. 

Well, I understand that, but this strikes me as 

12:21:10 

24 


something a little different than remembering a 

12:21:12 

25 


specific equation. This is a concept. 

12:21:14 
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1 To your knowledge, was your regression 

2 sufficiently sensitive to tell the difference 

3 between expenses for different diseases? 

4 MR. HAMLIN: Objection; asked and 

5 answered. 

6 THE WITNESS: I don't remember. I don't 

7 recall that specifically. 

8 BY MR. SILFEN: 



9 Q. Okay. Now, there is a — if you have the materials 

10 available to check that, you might check it because, 

11 of course, then the questions would be, well, why? 

12 And I can't ask you those questions when you don't 

13 know. 

14 There is also a smoking term. Do you 

15 remember what the smoking term is in the expense 

16 regression? 

17 A. I believe it's the conditional expectation of the 

18 smoking variable as specified in the bivariate 

19 model. I'm not sure. Given the reported, the 

20 result, but I'd have to go back and look at the 

21 equation, again. 

22 Q. Why don't I tell you what I believe it is and then 

23 you can check because, obviously, then I'm going to 

24 ask you why you made certain choices. 

25 I believe that there is no main effect of 


12:21:42 
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12 : 22:00 
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1 smoking in the expense equation. 

2 A. This is the probability of expenditure, correct? 

3 Q. Yes, or level of expenditure, either one. I don't 

4 mean this — I mean this to refer more to my use of 

5 the terms than your knowledge of them. 

6 Do you know what I mean when I say main 

7 effect of smoking? 

8 A. I know — well, I know what a main effect is in a 

9 statistical model. 


10 

Q. 

Okay. 

Again — 

11 

A. 

But I 

don't know what you mean 


12:22:46 
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12 Q. 

13 

14 

15 

16 

17 

18 

19 A. 

20 
21 
22 

23 

24 

25 Q. 


I'm trying to make sure that I'm using the terms 
correctly, not testing your understanding. I know 
you understand it. 

What I think of as a main effect would be 
smoking alone as opposed to smoking in an 
interaction term. I'm drawing that distinction. Is 
that sensible? 

In some contexts, it is sensible. In this 
particular context, what I believe is in the model 
are what you might more usefully think of as two 
main effects. 

One main effect for public and one main 
effect for private insurance. 

That is my understanding. What is in the regression 
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1 

2 

3 A. 

4 Q. 

5 

6 

7 A. 

8 Q. 

9 

10 

11 A. 

12 
13 


are terms for pubsmk or privsmk, depending on the 
insurance status of the person? 

Right. 

Why do we have — well, what do you mean by two main 
effects? This is an interaction term, is it not, 
statistically, pubsmk? 

Yes. 

Well, then, why do you call it two main effects 
rather than two interaction terms? What distinction 
are you drawing? 

Because those two terms are meant to reflect the 
effect of smoking on expenditures where you get — 
you allow for there to be one size effect for public 
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14 

15 

16 

17 

18 

19 Q. 

20 A. 

21 
22 

23 

24 Q. 

25 


expenditures and a different size effect for private 
expenditures. 

Both of them are the effects of smoking, 
but just it allows it to have a different value for 
the two subsets. 

Why are you doing that? 

There's the possibility, as we discussed previously, 
there was the possibility that the effects of 
smoking might be estimated to be different in these 
two subgroups. 

Yeah, but you've already got a disease variable in 
this regression. Take my word for it that the 
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10 

11 A. 

12 

13 

14 Q. 

15 

16 
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variable is any smoking-related disease. Miller 
told me it was. On the second day he came back and 
he affirmed. 

That means that the significance of the 
smoking coefficient here isn't the contribution of 
smoking to the occurrence of disease, it's the 
contribution of smoking to the probability of 
expense over and above the effect of having a 
disease. 

Isn't that right? 

Over and above the effect of controlling for a 
variable that's in that model, which may or may not 
accurately and perfectly reflect the condition. 

But you're agreeing with me that what you're 
measuring here is not the effect of smoking on 
disease and then on expense, because you've already 
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17 

18 A 

19 Q 

20 
21 
22 

23 A 

24 Q 

25 A 


1 Q 

2 A 

3 

4 Q 

5 

6 

7 A 

8 Q 

9 

10 

11 

12 A 

13 

14 

15 Q 

16 A 

17 

18 Q 


got a variable for disease in here? 

I've answered. 

My question is: When you measured the effect of 
smoking on disease in your conditional 
probabilities, you didn't limit it to a Medicaid 
population, right? 

In the — let's see, in the bivariate model? 

Yeah. 

I think that's correct. 
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So — 

Although, I would want to refresh myself to be 
sure. 

Why do you come here in this expense model with the 
effect of smoking on disease already determined and 
put in an interaction term? 

I don't understand your question. 

Well, you've measured the effect of smoking or the 
correlation of smoking with disease already by the 
time you get to this probability of expense 
equation, correct? 

We have, prior to fitting that equation, a model 
which allows us to estimate the conditional 
probabilities that are necessary in equation 3.1. 
Right. 

We do not yet have a model that allows us to 
estimate the expenditures. 

I understand. But in your expenditure regression. 
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19 

20 
21 
22 

23 

24 

25 


A. 

Q. 

A. 


you have a disease term, which is any smoking, major 
smoking-related disease, yes, no. All right, you'll 
accept that? 

I don't have any choice since I don't remember. 

Okay. 

Although, the best answer is I don't remember 
exactly the form of that equation right now, but go 
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1 

2 Q. 

3 

4 

5 A. 

6 

7 

8 

9 Q. 
10 
11 
12 

13 A. 

14 

15 

16 

17 

18 Q. 

19 

20 

21 A. 


ahead. 

Assuming that is the form of the equation, what is 
the interpretation of the coefficient of the pubsmk 
term? 

It's the additional expenditure for — additional 
public expenditures for persons who are smokers, 
conditioned on the reported value that's in the 
equation for currently treated disease. 

My question is: Why would you specify your model so 
that that value is measured in a Medicaid 
population, but the relationship of smoking to 
disease is not? 

Well, it's — one reason is because it's likely that 
expenditures will differ in these two populations 
having to do with demand or access or other sorts of 
things. Whereas, the — well — and that's why that 
is in that equation. 

Is this supposed to be an effect of smoking apart 
from its biological effect? Is that what we're 
measuring here when you use the word demand? 

That could be one possibility. 
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22 

Q. 

What else could be 

in there? 

12:29:34 

23 

A. 

I'm not sure right 

now. 

I actually find it 

12:29:40 

24 


difficult to think 

about 

the equation without seeing 

12:29:42 

25 


it in front of me. 

and I 

will review it at lunch. 

12:29:46 


1 Q. 

2 

3 
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10 A. 

11 
12 

13 

14 

15 

16 Q. 

17 

18 

19 

20 
21 
22 

23 A. 
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Well, I know you have big lunch plans, and I don't 
— we can come back to that tomorrow. I don't want 
to get in the way of lunch. I'm serious, if you can 
get it in mind at lunch, that's fine, otherwise we 
can get it the next day. 

Do you know how — you probably don't 
remember this, either — how the pubsmk or privsmk 
variable was figured, whether it's a 01 or some 
other kind of construct? 

See, I think it was possible for people to shift 
their source of payer. And it may be that it's 
reflecting the fraction of time that — my 
recollection is it may be reflecting the fraction of 
time that they were receiving payment from one 
source or another, but I don't remember exactly. 

Now, we're going to take the information from our 
conditional probabilities and our expense regression 
and we are then going to calculate the expected 
expense in three different scenarios, nonsmoker 
without disease, nonsmoker with disease, and smoker 
with disease, correct? We need that for our final 
equation? 

We need three terms, expenditures for smokers with 
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24 


disease, for nonsmokers with disease, and for 


12:32:36 


25 smokers with — I'm sorry, for nonsmokers without 12:32:42 
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disease. 

And jumping ahead a moment, we're going to — 
everything we've done and talked about so far is 
being done in NMES, just to get us grounded, 
correct? 

The estimation of these models is done with NMES 
data, correct. 

Now, ultimately we're going to run people from BRFSS 
through these models, is that a term that makes 
sense to use it that way? We are going to apply 
these models to BRFSS persons. How do you say it? 
Behavioral Risk Factor Survey. 

Yeah, but am I saying that right, we are going to 
apply these models to the BRFSS people, or we're 
going to run the BRFSS people through these models? 

I wouldn't say it that way. 

How would you say it? 

I would say that we're going to apply these models 
to the claims data, and that where information is 
lacking on the claims data about certain variables 
in this model, we will assume values for those 
variables like we see for people in Minnesota, as 
assessed in the Behavioral Risk Factor Survey. 

For example, information on overweight or 
seat belt use. 
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1 


THE WITNESS: I don't mean to interrupt 

12:34:38 

2 


the proceedings, I would very much appreciate a 

12:34:40 

3 


chance to use the bathroom now. 

12:34:56 

4 


THE VIDEOGRAPHER: Temporarily going off 

12:35:00 

5 


the record. The time is 12:35 p.m. 

12:35: 04 

6 


(A lunch break was taken.) 

12:35:06 

7 


THE VIDEOGRAPHER: We're back on the video 

13:56:32 

8 


record. This is the third tape of the videotape 

13:56:34 

9 


deposition of Scott Zeger. The time is now 1:56 

13:56:38 

10 


p.m. 

13:56:38 

11 

BY 

MR. SILFEN: 


12 

Q. 

Well, we're back from lunch. Hamlin and Zeger had a 

13:56: 42 

13 


big lunch out. I sat here and thought every 

13:56:46 

14 


minute. 

13:56:48 

15 


Here was my thought, in the refined 

13:56:54 

16 


disease model for lung cancer/COPD, the only factors 

13:57:02 

17 


you controlled for are age, gender, disease? 

13:57:08 

18 


Anything else, that's it? 

13:57:10 

19 

A. 

That's it. 

13:57:10 

20 

Q. 

For CHD/stroke, you controlled for a number of other 

13:57:18 

21 


factors? 

13:57:20 

22 

A. 

Correct. 

13:57:20 

23 

Q. 

And forgive me if I asked you this before, but I'm 

13:57:26 

24 


not sure I did. Why that difference? Why you 

13:57:30 

25 


controlled for a number of factors for CHD/stroke 

13:57:34 
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2 A. 
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13 Q. 

14 

15 

16 

17 

18 A. 
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but not for lung cancer? 13 

This was recommended by Dr. Samet, and it was, as I 13 
recall his rationale, it was because of the fact 13 

that COPD and lung cancer have such a very high 13 

attributable fraction, that is to say that smoking 13 

is the very much dominant reason for the biological 13 
process that leads to those diseases. 13 

Whereas for the others, it wasn't to the 13 

same extent. And that other factors, such as 13 

socioeconomic status and the like might play a 13 

relatively more substantial role than they do for 13 

lung cancer in COPD. 13 

But what would have been the cost of using the same 13 
methodology and controlling for other factors? I 13 

guess why not do it, especially since so many 13 

factors were dropped out if they didn't need to test 13 
their significance, anyway? 13 

It was a judgment call. Typically when there is a 13 

strong biologic rationale for making a decision on a 13 
model, I think that tends to be followed and there 13 

is a cost of having to estimate more things, time, 13 

you know, computer runs, variability added to the 13 

estimate when variables, which aren't necessary, are 13 
added to a model. So I think those were the 13 

considerations that led us to this decision. 13 
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1 Q. Isn't it an unconservative way to do it? 13 

2 A. I don't know if it's unconservative or 13 


http://legacy.library.ucsf.@3iJUijft^i^c|(i6§a0QfipBif.industrydocuments.ucsf.edu/docs/rqfl0001 


57:34 
57:38 
57:46 
57:50 
57:54 
57:58 
58 : 00 
58:02 
58 : 04 
58 : 08 
58 :12 
58 :14 
58:22 
58:28 
58:32 
58:36 
58:38 
58:44 
58:48 
58:52 
58:58 
59:02 
59:06 
59:08 
59:10 


59:12 

59:16 



3 

4 Q. 
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9 A. 

10 Q. 
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25 


anticonservative or some other conservative. 

Here actually was my thought over lunch. Does that 
rationale hold up when you have made another choice, 
which is to include in the dollars the heart 
disease, diminished health status, and every other 
expense incurred by those people? 

Yes. 

Why? Let's assume for the moment that it's not 
trivial, that 30 or 40 percent of the dollars that 
we are putting in the lung cancer pot are not lung 
cancer dollars. Wouldn't we want to take other 
factors into account? 

Well, I've already answered that there's a couple 
things. First is that the dollars which are not 
lung cancer dollars might be as large as they are 
because of the presence of lung cancer. 

There is still the problem of having to do 
the estimation when — if you go ahead and try to 
fit the model for the lung cancer/COPD as you have 
for the others. 

And there is — I would grant the 
possibility that if what we have in the lung 
cancer/COPD is a mix of — well, is an addition to 
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1 

2 

3 

4 


lung cancer/COPD some other costs, and if those 
costs are strongly associated with these controlling 
variables, then you might actually get some ability 
to do some control from those additional costs. 
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5 


That, I would agree with. 


14:01:20 


6 Although, I would add that the difference 14:01:24 

7 between the core estimates, which controlled only 14:01:28 

8 for those I think 16 strata, and the refined model 14:01:34 

9 was not very large, the refined model having 14:01:38 
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21 


22 
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23 

Q 

24 
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25 

Q 


controlled for all those other factors. 

So I would speculate at this moment that 
if we had done that, it would be unlikely to have 
made much difference since doing all the control for 
the other sorts of things we looked at didn't seem 
to make very much difference. 

Of course, the principal point we're making here, 
which is the lung cancer wasn't just lung cancer 
dollars, applies to the core model, as well? 

I've already addressed, though, that issue. 

Well, that was my lunchtime thought. What was your 
lunchtime thought? 

I thought a little bit — well — 

Couldn't trick you. 

I thought about tuna fish. 

Okay. I tell you what, I looked up Dr. Miller's 
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1 answers to the questions we were talking about 

2 here. Let me give you copies of his deposition, 

3 which I know you've already seen. I lugged them all 

4 the way from Washington. Page 209. Actually page 

5 208, 209. It's the first page of the second day. 

6 A. I don't have page 208 or 209. 

7 Q. You probably do, actually. 



8 

A. 

Do I? 


14:03:06 

9 

Q. 

Oh, it's one fat copy. 


14:03:18 

10 


MR. HAMLIN: This is Volume 

I. This is 

14:03:28 

11 


Volume II. 


14:03:28 

12 

BY 

MR. SILFEN: 



13 

Q. 

Bottom of 208, it's me saying, "In the 

refined 

14:03:34 

14 


disease model," do you have that? 


14:03:36 

15 

A. 

Line 25, "In the refined disease model 

in the"? 

14:03:38 

16 

Q. 

"Expense regression," and then I say. 

"are we in the 

14:03: 42 

17 


same place?" That's a routine you're 

familiar with. 

14 : 03 : 44 

18 


"Mm-hmm (yes)." 


14:03 : 44 

19 

A. 

Right. 



20 

Q. 

"I asked whether the disease variable 

was disease 

14:03:48 

21 


specific or was yes, no, any smoking-related 

14:03:48 

22 


disease, and the answer is? Any." 


14:03:50 

23 


Do you agree with that? 


14:03:54 

24 

A. 

Yes, I do. 


14:03: 54 

25 

Q. 

And also I had actually earlier asked 

him why it was 

14:04: 04 
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1 "any" rather than "disease specific," but his first 

2 instinct, like yours, was that it was "disease 

3 specific." And he answered that at page 137. 

4 A. Okay. 

5 Q. And you see me saying, "I may well have asked 

6 already, which is why am I seeing an any disease 

7 coefficient rather than a COPD/lung cancer 

8 coefficient and a CHD/stroke coefficient." 

9 And if you'll read through there, you'll 
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10 


see that Dr. Miller's recollection was that you 

14:04:50 

11 


would try it and that it didn't make any 

14:04: 54 

12 


difference. 

14:04:56 

13 

A. 

I don't have any other — 

14 : 04 : 56 

14 

Q. 

Explanation? 

14:04:58 

15 

A. 

To add to that. I don't know. 

14:05: 04 

16 

Q. 

Well, let's take a look at his last line there. I 

14:05:08 

17 


said to him — 

14:05:10 

18 

A. 

Give me the page again. 

14:05:10 

19 

Q. 

137 . 

14:05:12 

20 

A. 

Okay. 

14:05:12 

21 

Q. 

Line 17. I said, "In other words, given that you 

14:05:16 

22 


had a one, any one of the major smoking-related 

14:05:22 

23 


diseases, there will be no difference in the 

14:05:24 

24 


expected expenditure?" And his answer was, "Data 

14:05:26 

25 


was unable to distinguish between one classification 

14:05:28 
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1 and another." 

2 Does that jog your recollection? 

3 A. No, I have no other recollection about this. 

4 Q. Okay. Let's assume Dr. Miller is correct. What is 

5 the significance of your regression being unable to 

6 distinguish between the expense of one 

7 smoking-related disease and another? 

8 A. I know of no special significance. 

9 Q. Well, do you think that it's a fact that all 

10 diseases have the same cause? 

11 A. I would assume it's not a fact that all diseases 

12 have the same cost, but I don't know the extent to 
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which they differ one from the other. 14:0 
Well, I assume with you that all diseases do not 14:0 
have the same cost. And so I ask again: What does 14:0 
it mean this regression can't tell the difference 14:0 
between one disease and another? 14:0 
I don't think it means anything. 14:0 
Okay. To your knowledge, does given that you have a 14:0 
smoking-related disease or one of the major smoking 14:0 
attributable diseases, does it matter to this report 14:0 
whether you're a smoker or a nonsmoker? 14:0 
When you say to this report, what do you mean? 14:0 
Yeah, I mean, anywhere in it. Well, we can stick to 14:0 
this expense regression, if you want. But to the 14:0 
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1 


expense regression or the average expense, do you 

14 :0 

2 


know if it matters whether you are a smoker or a 

14 : 0 

3 


nonsmoker? 

14 :0 

4 

A. 

Well, the models for the major smoking-related 

14 :0 

5 


diseases estimate the expected expenditures for 

14 :0 

6 


smokers versus for nonsmokers or for — and so in 

14 : 0 

7 


that sense it does matter, yes. 

14 :0 

8 

Q. 

But the average expense, if you have a 

14 :0 

9 


smoking-related disease, does it — 

14 :0 

10 

A. 

Or if you don't have a smoking-related disease. 

14 :0 

11 

Q. 

Does it differ between a smoker and a nonsmoker? 

14 :0 

12 

A. 

I'm sorry, ask that question again. 

14 :0 

13 

Q. 

Does the average expense, expected expense, given 

14 :0 

14 


that you have one of the smoking-related diseases. 

14 :0 
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15 differ between a smoker and a nonsmoker, do you 

16 know? 

17 A. Are you asking about the fit of the model to the 

18 data, the coefficients in the model? 

19 Q. I'm asking whether in this model, okay, a smoker 

20 with a smoking-related disease has a different cost 

21 than a nonsmoker with a smoking-related disease? 

22 A. I believe that this model allows for that 

23 possibility. 

24 Q. Okay. All right. Well, then we'll find out where. 

25 Now, one thing, when we broke we were 
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5 A. 


6 Q. 
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9 A. 

10 Q. 

11 A. 

12 

13 Q. 

14 A. 

15 

16 
17 


talking about how the BRFSS people are ultimately 14:0 

going to be used when we get through this model. 14:0 

Am I correct that first we estimate the 14:0 

model wholly in NMES, correct? 14:0 

Correct. 14:0 

And then we create a SAF. And the SAF is a product 14:0 

of the parameters of the model estimated in NMES and 14:0 

the characteristics of the BRFSS population? 14:0 

Not entirely. 14:0 

What else is in there? 14:0 

The characteristics of the group of people for whom 14:0 
we have a pot of money. 14:0 

How do they affect the SAF? 14:0 

Well, we know characteristics. We basically started 14:0 
with the expenditure data for the State of Minnesota 14:0 
and Blue Cross/Blue Shield; we have every 14:0 

expenditure. 14:0 
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19 

20 
21 
22 

23 

24 

25 


And we break it up into little smaller 14:0 

pots for which we know the person's age, we know 14:0 

their gender, we know their marital status, often we 14:0 
know their education, we know things from the 14:0 

billing record. 14:0 

And so we have the whole pool of money for 14:0 
people like that with those variables. Those are 14:0 

variables that are in our model we fit from NMES. 14:0 
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Now, there are some additional variables, 
as well, in the model from NMES, that we fit from 
NMES, like seat belt use and like whether they're 
overweight. 

But surely you are not telling me that the 
information in the claims data affects the 
calculation on percent? 

I am telling you that. 

How is it used? The calculation of the SAF, not the 
smoking attributable expense that you ultimately 
calculate. 

Yes, I heard what you said. 

How does it — we looked together at the formula, at 
the ultimate formula. 

Yes. 

Can you show me in the formula where the information 
from the claims data enters the formula? And I'm 
not asking you where factors, demographics like 
those in the claims data are used in the formula. 
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20 


I'm asking you where the information in the claims 


14:10:54 


21 data is used in the formula? 14:10:56 

22 A. I don't understand your question. 14:10:58 

23 Q. Well, then show me where the information in the 14:11:00 

24 claims data is used in the formula. 14:11:02 

25 A. Well, the claims data includes information like the 14:11:04 
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person's gender or like their education I think in 
some cases. So where it's used is in equation 3.1 
from this document 2324. 

It has six different quantities, which are 
estimated from our bivariate probit model, and from 
the expenditure, any expenditure and size of 
expenditure. Those models are fit to the data. 

Right. 

And the predictors for those quantities include 
things like education, which is on the claims form. 
Right, but — 

And so we take that education value and we stick it 
in the equation, which allows us to calculate these 
six quantities with the quantities we need to make 
the SAF calculation. 

So, for instance, are you telling me that if the 
education — do individuals from the claims data get 
a SAF? 

No, I did not say that. 

Do individuals from BRFSS get a SAF? 

Well, individual records, perhaps — yeah, 
individual records from the Behavioral Risk Factor 
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24 

25 


Survey are matched to the category, the claims 14:12:42 

category, matched by age, by gender, and so forth. 14:12:48 

And the reason we do that is because we 14:12:48 
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then get some additional information about people 
for whom — people in that claims for whom we're 
trying to calculate a SAF. 

And that additional information is 
information like whether they're overweight and so 
forth. And what we do then is we calculate the SAF 
using these six quantities which we've talked 
about. 

Sure. 

Using the demographics from the claim, because 
that's the people whom we're going to apply the SAF, 
and the additional information we were able to get 
from BRFSS, which we otherwise wouldn't have because 
it's not on the claims data. 

Is it your position that the proper pronunciation is 
BARFUS (phonetic)? 

I apologize, it's the Behavioral Risk Factor 
Survey. I actually have difficulty with either 
BRFSS or BARFUS (phonetic). 

We have to get the important stuff out. 

I appreciate the subtle contribution you made by 
using BRFSS rather than BARFUS (phonetic). 

You're telling me something I didn't know, and 
that's why I'm giving you trouble on this. I 
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25 thought, my impression was that it was BRFSS 14:13:52 
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1 


demographics that were used to weight the parameters 

14:13:58 

2 


of the model. 

14:14:00 

3 


You got a SAF, then the SAF was applied to 

14:14 : 04 

4 


the claims pot, a pot that was made from the claims 

14:14:08 

5 


data, but you're saying that's not correct? 

14:14:12 

6 

A. 

I think we're actually saying the same thing. 

14:14:16 

7 


because that demographic information is both on the 

14:14:18 

8 


claims record and also on the Behavioral Risk Factor 

14:14:24 

9 


Survey information. 

14:14 :24 

10 

Q. 

If what you're saying — 

14:14 :26 

11 

A. 

They're matched that way. 

14 :14 :26 

12 

Q. 

Okay. If what you're saying is that the claims data 

14:14:30 

13 


is broken up into cells, I don't like your term 

14:14:34 

14 


little pots, into cells — 

14:14:36 

15 

A. 

What do you think about big pots? 

14:14:42 

16 

Q. 

— into cells with particular sets of 

14:14:46 

17 


characteristics, and then you take BRFSS people, or 

14:14:50 

18 


BRFSS records, if you will, with similar 

14:14: 52 

19 


characteristics, and perhaps some additional 

14:14:54 

20 


characteristics, and use them to weight the 

14:15: 00 

21 


parameters of model and then you get a SAF and then 

14:15: 06 

22 


you lay it back against that cell, then I understand 

14:15:12 

23 


you. 

14:15:14 

24 


Is that what you're saying? 

14:15:14 

25 

A. 

Let me say it in my words to be explicit. 

14:15:14 
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1 

Q. 

Sure. 

14:15:14 

2 

A. 

What we're doing is calculating a SAF to apply to 

14:15:18 

3 


claims data, expenditure data. 

14:15:22 

4 

Q. 

Right. 

14:15:22 

5 

A. 

And we have done a regression analysis so that that 

14:15:24 

6 


SAF can vary by factors, some of which are listed on 

14:15:32 

7 


the claims data, and so we're going to use the SAF 

14 :15:36 

8 


that corresponds to those variables, demographic 

14:15:38 

9 


variables, on the claims record, but some of which 

14:15:40 

10 


are not. 

14:15:40 

11 


And to get information about the ones that 

14:15: 42 

12 


are not, we take all the people in the Behavioral 

14:15:46 

13 


Risk Factor Survey who match this cell, this cell of 

14:15: 50 

14 


claims with respect to the demographics, but who 

14:15:54 

15 


also then have information about seat belt use and 

14 :15:56 

16 


overweight. 

14:15:56 

17 


And we calculate the average SAF to apply 

14:16:00 

18 


to this claims cell using the values — we get the 

14:16:06 

19 


average SAF by using the information about 

14:16:08 

20 


overweight, seat belt use, and so forth, that's not 

14:16:12 

21 


on the claim data, that is in the behavioral risk 

14:16:14 

22 


factor data to apply to the cell that we're talking 

14:16:18 

23 


about. 

14:16:20 

24 

A. 

And — 

14:16:24 

25 

Q. 

Is every person in BRFSS given an opportunity to 

14:16:32 
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1 


2 A. 

3 

4 

5 

6 

7 

8 
9 

10 Q. 

11 

12 A. 

13 

14 

15 

16 Q. 

17 A. 

18 Q. 

19 A. 

20 Q. 

21 
22 

23 

24 

25 


have a SAF? 

To the best of my knowledge, every person for whom 
we have a — every person for whom we have a record 
on the Behavioral Risk Factor Survey who corresponds 
to a claims cell gets to contribute their additional 
variables so that the SAF reflects the average, the 
SAF reflects the average of those additional 
variables over the behavioral risk factor people in 
that cell. 

What are the factors that you recall define the 
cells? 

Let's see, it's age, it's gender, it's obviously a 
pair type. I believe not on all but on some of the 
records it also includes marital status, educational 
level. 

I think race, too. 

I think race, as well. 

Dr. Miller said — 

I'm not sure of all of them. 

Dr. Miller described this in the context of the 
diminished health status model, but I didn't — so 
when we say match up, do we mean that you're going 
to take a BRFSS person and match him or her exactly 
for age, gender, race, and the other factors you've 
described with someone in a claims data cell? 
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1 A. Well, the cell is defined in 

2 variables I've listed to the 

3 recollection. 


terms of these 

14:18:38 

best of my 

14:18:42 


14:18:42 
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4 Q. 

5 A. 

6 

7 

8 Q. 

9 

10 A. 

11 Q. 

12 

13 A. 

14 

15 

16 Q. 

17 

18 

19 A. 

20 
21 
22 

23 

24 

25 


Right. 

And we then find all of the people in the Behavioral 
Risk Factor Survey who would fall into that cell by 
virtue of their values for those variables. 

Does the cell have a tolerance band, or is it you 
are — 

We're not matching. 

— you are a married person, you are Hispanic, you 
are — well, is it age, one-year age specific? 

I'm not sure. You'd have to refer to Dr. Miller, I 
think. I don't know exactly the width of the 
intervals. 

But whatever those factors are, you're going to have 
to be exactly on point. You're going to have to 
match them exactly? 

However the claims cell is defined, it might be 
defined in terms of multiple years of age or in 
terms of multiple education categories. 

But however it's defined, as I understand 
the application, the BRFSS people are chosen so that 
they match those defining variables. And then we 
calculate the SAF, the values of those variables 


14:18:42 
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1 

2 

3 

4 

5 


which define the cell, and the SAF is averaged over 
the other variables that we didn't have information 
on the claims record. 

To be precise, when we say average, what 
we actually do, and the reason perhaps there's 
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some 


7 

8 
9 

10 

11 

12 

13 

14 Q. 

15 

16 

17 

18 A. 

19 

20 Q. 

21 

22 A. 

23 Q. 

24 A. 

25 


— where there was a little bit of 
miscommunication was we actually calculate the SAF 
for all the people who fall into that cell and then 
use the average of the SAFs rather than taking the 
average of their behaviors, behavioral variables, 
and then you plugging that in once to the SAF 
calculation as a way to deal with some nonlinearity 
that can arise if you do the latter. 

And you implied earlier, although I'm not sure you 
said you knew, that there would be some BRFSS people 
that would not be used because they didn't match 
up? 

I don't know that that would be the case. I didn't 
do the matching myself, and I'm not sure. 

Peter asked whether two people could match with one 
or was only one-to-one? 

Match with one what? 

One person in a cell with another. 

Well, I didn't — the matching is not with 
individuals in the cell, everybody in the cell. In 
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1 fact, what happens is we accumulate the dollars in 

2 the cell. It really is a pot. We accumulate the 

3 dollars, but the dollars have a label on them which 

4 is these demographic characteristics. 

5 Q. This may be a semantic point, but, nonetheless, 

6 let's assume that everybody from BRFSS is used, 

7 okay? In what sense does the Minnesota claims data 

8 affect the SAF? 
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9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


Haven't you done nothing more than create 
compartments that your SAFs can be laid back up 
against? 

MR. HAMLIN: Objection to form. 

THE WITNESS: Yeah, I don't understand the 

question. 

BY MR. SILFEN: 

Q. Here's my point. Let's assume for the moment, which 
I have a feeling is true, that every BRFSS person is 
used, you're not just discarding some. Okay? 

A. Okay. 

Q. And I think I would have heard about it from 

Dr. Miller or Dr. Wyant somewhere along the way if 
BRFSS people were being thrown in, thrown out, so 
assume every BRFSS person is being used. 

In what sense is the SAF, the results or 
the SAFs, the results, affected by the claims data? 
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1 A. Well, it's affected by the claims data in the sense 

2 that the values put into the equations fit in this 

3 model. 

4 Q. But — 

5 A. The values for those demographic variables which 

6 define the cells, those are the things that go in 

7 here to calculate the SAF. 

8 Q. I understand. Dr. Zeger, but suppose there were no 

9 claims data, wouldn't you have exactly the same 

10 SAF? 
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11 

A. 

No. 

14:23:06 

12 

Q. 

I mean, your NMES, your parameters of your NMES 

14:23:14 

13 


model would be the same without the claims data. 

14:23:18 

14 


right? 

14:23:18 

15 

A. 

The parameters of the NMES model are fit from the 

14:23:22 

16 


NMES data. 

14:23:24 

17 

Q. 

They would be the same regardless of what was in the 

14:23:26 

18 


claims data? 

14:23:28 

19 

A. 

Correct. 

14:23:28 

20 

Q. 

And the BRFSS people, assuming we used them all. 

14:23:32 

21 


they don't change, either, based on what's in the 

14:23:34 

22 


claims data, right? 

14:23:36 

23 

A. 

The BRFSS, the behavioral risk factor data are the 

14:23:42 

24 


behavioral risk factor data. 


25 

Q. 

And if we use them all, we are going to get a SAF 

14:23:46 


158 


1 


that is a product of two things, the 

demographics of 

14:23:50 

2 


the BRFSS people, and the parameters 

from the NMES 

14:23:52 

3 


model, right? 


14:23:54 

4 

A. 

Wrong. 


14:23:54 

5 


MR. SILFEN: I'm sorry. Do you see where 

14:24:00 

6 


I'm wrong? I'm just trying to shortcut this if I 

14:24: 02 

7 


can. 


14:24:02 

8 

BY 

MR. SILFEN: 



9 

Q. 

Peter is suggesting that maybe what 

I'm missing and 

14:24:28 

10 


what you're saying is that the BRFSS 

data gets 

14:24:30 

11 


somehow weighted by the creation of 

the cell, is 

14:24:32 

12 


that it? 


14:24:32 

13 

A. 

We calculate a SAF only for a cell. 

And the people 

14:24:38 
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14 that we — whose SAFs we average across are the 

15 people who match that cell. 

16 If there wasn't that cell, it wouldn't be 

17 used. So how the people are grouped together 

18 depends upon the claims data. 

19 Q. Where I went wrong was I kept on thinking that each 

20 BRFSS individual goes in. If that were true, then 

21 what I was saying would be right. But if they go in 

22 and they create only a cell SAF, I see the 

23 difference. 

24 So what's the — I think I see that the 

25 demographics of the claims data then do affect, do 
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1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


play into the formula. 

Okay. Well, that was very helpful. Most 
of the time we hope we're finding out things we 
know, but I didn't understand that. 

Now, where we were when you guys took your 
luxurious lunch was we were at the — we were still 
in the expense part of the model, and we had talked 
about the estimation of the regression. 

And now, as I understand it, the next 
thing that happens is we have to take our — some of 
the information from the regression and some of our 
demographics and some of our conditional 
probabilities and we have to figure out an average, 
an expected average expense, for three different 
scenarios; is this correct? 
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16 A. 


You're referring to equation 3.1? 


14:26:34 


17 Q. Well, no, actually, I'm referring to components of 

18 the equation. 

19 A. The expenditure components of the equation 3.1? 

20 Q. As I understand it, we have to calculate an average 

21 expected expense for a nonsmoker without a current 

22 treated disease and nonsmoker with a current 

23 treatment and a smoker with current treatment? 

24 A. I agree. 

25 Q. And I actually think that the last — this is the 


14:26:38 
14:26:40 
14:26:44 
14:26:46 
14:26:48 
14:26:52 
14:26:54 
14:26:56 
14:27:02 


last piece of this puzzle we're all happy to say. 14:27:06 

I understand that there are — the 14:27:10 

calculation is different for each of those three 14:27:18 

scenarios, but the components are these: First, the 14:27:24 
coefficient for the various covariates we're going 14:27:34 

to use, such as education. 14:27:36 

And, second, for the smokers with and the 14:27:50 
nonsmokers with, the coefficient for the disease 14:28:00 

covariate in the regression model, you with me so 14:28:06 


10 


far? 


14:28:06 

11 

A. 

I think so. 


14:28:06 

12 

Q. 

And those are easy. And then we also have 

for the 

14:28:26 

13 


nonsmoker without, we have the coefficient 

for 

14:28:34 

14 


pubsmk if not significant multiplied times 

the 

14:28:44 

15 


percentage of smokers in BRFSS. You're looking — 

14:28:50 

16 

A. 

You lost me at the end there. 


14:28:52 

17 

Q. 

Okay. I have four pieces of the equation. 

The 

14:28:58 

18 


first two were easy; the coefficient for the 

14:29:02 
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19 

20 
21 
22 

23 

24 

25 


demographics and the coefficient for the disease 
factor, okay. 

There are two others. They are both the 
pubsmk or privsmk factor. For nonsmokers without, 
you get this term, coefficient for pubsmk if not 
significant and it is multiplied times the 
percentage of smokers from BRFSS. 


14:29:06 

14:29:08 

14:29:14 

14:29:26 

14:29:40 

14:29:42 

14:29:44 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 A. 

11 Q. 

12 

13 A. 

14 

15 Q. 

16 

17 

18 

19 

20 


161 

And for smokers with — the record should 
reflect the witness is already shaking his head, but 
why don't I finish and then you'll have it all and 
we'll talk about it. 

For smokers with, we get also, we get the 
coefficient for pubsmk or privsmk if significant 
multiplied times the probability of smoke, given 
smoking-related diseases. 

Just all blank here? 

I can't tell you whether that's correct or not. 

And I take it you also don't have any sense what it 
means? 

It was a long stream of words. It's hard for me to 
follow. 

I understand. Why don't I back up and make it 
simple. Let's assume that in the equation for the 
calculation of the average expense we have our 
demographics, our coefficient for demographics and 
our coefficient for disease, yes, no, and now we 
have a pubsmk or privsmk term? 
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21 A. Agreed. Let me see if I — what I can agree. The 

22 regression coefficient, the regression model, both 

23 for the probability of any expenditure and for the 

24 size of the expenditure, given any, that involves 

25 the covariates, smoking, whether it be pub or 
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162 


1 


private, and disease, currently-treated disease. 

14:31:24 

2 

Q. 

So we have those three parts. Dr. Miller also did 

14:31:30 

3 


not at first asking could not explain this. And 

14:31:34 

4 


then the next day he did explain it. And I'm not 

14 : 31:38 

5 


trying to do a test here. It was extremely puzzling 

14:31: 42 

6 


to us. And I would be interested in your 

14 : 31: 44 

7 


explanation of this construction. 

14:31: 50 

8 


I should — you might want to look at his 

14:31:52 

9 


deposition because he actually on the second day 

14:31:54 

10 


came in and said that I was, in fact, reading this 

14:32: 00 

11 


from the code correctly, but that it was a mistake. 

14:32:04 

12 


that you guys had — that it wasn't what you should 

14:32:06 

13 


have done. 

14:32:08 

14 


I'm not saying he was saying it was an 

14:32:10 

15 


awful huge thing. 

14:32:12 

16 


MR. SILFEN: Do you want me to show him? 

14:32:14 

17 


MR. HAMLIN: Sure. Well, it's your 

14:32:20 

18 


deposition, Tom. I thought you were going to refer 

14:32:22 

19 


him to a page, that's all. 

14:32:24 

20 


MR. SILFEN: Maybe I should do that. 

14:32:26 

21 

BY 

MR. SILFEN: 


22 

Q. 

If you look at page 210, and I don't know whether 

14:32:36 

23 


I'll have you read it today or just you can take a 

14:32:40 
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24 look at it and we can orient on what it is. 14:32:54 

25 If you look at 210 right in the middle 14:32:56 
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13 

14 

15 A. 

16 

17 

18 

19 

20 Q. 

21 
22 

23 

24 

25 


you'll see me saying, "Okay. I asked in particular 
why it appeared that pubsmk appeared twice and was 
specified so that once it was in if not significant 
and another place it was in if significant. Did you 
see that?" 

He goes on and explains, and if you follow 
that explanation down to the bottom of 212, 

Dr. Miller ends up saying that he's discovered an 
error in the code last night when he looked at this 
matter. 

I think it would not be productive to have 
Dr. Zeger read this now. But since you have no 
recollection of this matter, if you could just take 
a look at that page or two tonight. 

I don't want it to appear that I have no 
recollection of the matter. I was — it's hard to 
read an equation and, you know, be able to comment 
whether it's correct or not, so I'd be happy to read 
this . 

I think the record will reflect that you affirmed 
there would be three pieces to this, which would be 
the demographic covariates and the disease and the 
smoking covariate, and that it was the details of 
the smoking covariate that we had a problem. 

Now, having said that, having said that, I 
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25 



do — I should also say that neither you nor 
Dr. Miller were able to explain at first look what 
seems to me to be a fundamental aspect of the 
equation. So I will be looking forward to your 
explanation tomorrow. 

Well, I'm here to answer your questions. My comment 
to you was that I didn't understand the question. I 
feel capable to answer questions about this model 
and I would be happy to do so. 

Well, then let me ask you again. Why is it that the 
pubsmk or privsmk appear twice in the equation for a 
smoker with once if specified if not significant 
multiplied times percentage of smokers from BRFSS 
and once specified if significant multiplied by the 
probability of smoke given disease? 

Are you referring to the smoking variable in the two 
different regressions, the any expenditure and in 
the size of the expenditure when you say 2? 

Yeah, it would be probability — 

The reason that there are two coefficients is 
because another way to model this would be to have 
one model for the log expenditures, for example, and 
not have a separate model for any versus the size. 

But often in data like this where a large 
fraction of people have no expenditure — 
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1 Q. This isn't the regression. I don't mean to 

2 interrupt you. This is not the regression. This is 

3 the computation of the average expense. We've 

4 already discussed the regression and what was in 

5 it. 


6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


Now we're talking about the calculation, 
the computation — Peter also corrected me, it's not 
in there twice, once for probability and once for 
level. In the probability of expense calculation, 
computation — 

A. Probability of expense? 

Q. Yes. For a — 

A. Probability of any expense you're saying? 

Q. Yes. For a smoker with COPD, the factors, the 

pieces of that equation are the coefficient for your 
covariates, the coefficient for disease, and the 
coefficient for pubsmk if not significant multiplied 
times the percentage of smokers from BRFSS and also 
the coefficient for pubsmk if significant multiplied 
times the probability of smoke given disease? 

MR. HAMLIN: Objection to form. 

BY MR. SILFEN: 

Q. I thought you had said you didn't understand this 
and we were — 

A. I didn't understand the question is what I said. 
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1 Q. What is this? 
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2 A. 

3 Q. 

4 

5 

6 
7 


9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


A. 

Q. 

A. 

Q. 

A. 

Q. 


I don't know. 

Well, I've referred you to the pages where 

Dr. Miller looked at it, and it's only a page or 

two, so why don't you take a look tonight. 

The other thing you might look at is I 
asked you whether a smoker and a nonsmoker would 
have a different disease, a different expense, given 
that they had a smoking-related disease. And you 
said that it was allowed for in the computation. 

Do you remember that exchange? 

I'm sorry, repeat that. A smoker and a nonsmoker 
both of whom had the disease? 

Would have a different expense for that disease in 
this — 

You asked, and I said I thought that was allowed 
for? 

Yes. And what I would like you to do, if you're 
going to look at this, is tell me where that's 
allowed for. 

Okay. 

Given that we haven't got it straight, it's silly to 
ask that next question. 

Let me ask you another question along the 
same line. Is there any place in the model, the 
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1 refined disease model, where the prevalence of 

2 smoking in BRFSS makes a difference? 

3 A. This is the refined — 

4 Q. Anywhere, yes, in the refined, as opposed to the 
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5 


6 A. 

7 

8 Q. 

9 

10 

11 

12 

13 

14 

15 

16 

17 A. 

18 

19 

20 
21 
22 

23 

24 Q. 

25 


prevalence in NMES. 14 

— refined disease model. To the best of my 14 

knowledge, no. 14 

Just to round out how the BRFSS claims data people 14 

are used, as I understand it, each BRFSS person or 14 

record matched, now I understand, with a claims data 14 
record, goes into the conditional probability 14 

regression and his or her characteristics go through 14 
those conditional probabilities, and the result is 14 

for that person a probability of disease and a 14 

probability of smoke. 14 

Is that correct? 14 

Well, if we take the demographic variables that 14 

define the claims cell, and then take a person from 14 
the Behavioral Risk Factor Survey that falls into 14 

that cell, then by taking their other variables, 14 

like whether they're overweight and so forth, we can 14 
calculate the components of the SAF, all the 14 

components in equation 3.1. 14 

Right. I understand that, and that is helpful. But 14 
I was trying to get those components piece by piece, 14 


168 


1 

2 

3 

4 

5 

6 A. 


and it seems to me there are two main components. 14 

One is the, as there always have been, the 14 

conditional probabilities and then the expense 14 

piece. And they go into the conditional probability 14 

regression — 14 

Right. 14 
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7 

Q. 

— with their demographics. 

14:42:06 

8 


And the result is that for that person you 

14:42:10 

9 


then have all the conditional probabilities based on 

14:42:14 

10 


their demographics? 

14:42:16 

11 

A. 

Correct. 

14:42:18 

12 

Q. 

And along the way, what you have calculated for them 

14:42:20 

13 


is a probability of disease and a probability of 

14:42:24 

14 


smoke jointly? 

14:42:28 

15 

A. 

You can calculate for — I mean, for the set of 

14:42: 34 

16 


demographics variables and the set of variables that 

14:42:38 

17 


come from a BRFSS record, you can calculate the 

14:42:42 

18 


joint probability distribution of being currently 

14:42 : 44 

19 


treated and smoking and, therefore, can calculate 

14:42:48 

20 


the conditional probabilities used in the equation 

14:42:54 

21 


3.1. 

14:42:56 

22 

Q. 

And in a sense, each person — I'll say each person 

14:42:58 

23 


and I will mean by that what I think you mean 

14:43:02 

24 


without trying to restate it, which is a combination 

14:43: 04 

25 


of the claims data and the BRFSS information. 

14:43:08 


169 


1 Okay? That won't work? 

2 A. I think I made it clear what I meant, yeah, and I 

3 think we're on the same wavelength. 

4 Q. You don't — 

5 A. I think we are. 

6 Q. Now, we also put each person or record into the 

7 expense equations and we put them in once as a 

8 never-smoker without, once as a never-smoker with, 

9 and once as a smoker with; is that correct? 
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10 A. 

11 Q. 

12 

13 

14 A. 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


I believe so, yes. 

Now, that means that we are, in a sense, going to 
get a SAF for persons who are in reality 
nonsmokers? 

Well, I've tried in my answers to the last couple of 
questions to make the following point: We're not 
interested in a SAF for a person from the Behavioral 
Risk Factor Survey, no interest to us. 

What we're interested in is a SAF for a 
cell of expenditures that's defined in terms of 
demographic parameters. 

But in an effort to do the best possible 
job controlling for other factors that were thought 
by Dr. Samet to be of interest, which were available 
in the Behavioral Risk Factor Survey, in order to do 
the best possible job, we found people from the 
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1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 


Behavioral Risk Factor Survey who matched this 
expenditure cell. 

And we calculated one SAF for that cell by 
getting the components of the SAF for all the people 
who matched and getting an average SAF from them. 
We're not interested in a SAF for this person from 
BRFSS who smokes — we don't care. 

We are just trying to use the BRFSS data, 
the Behavioral Risk Factor Survey data, to complete 
the story with respect to variables like overweight 
and seat belt use that we had available to us in 
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12 


NMES and, therefore, put into our regressions for 

14:46:02 

13 


the possibility that we could control from them. 


14 

Q. 

Nonetheless, the claims data doesn't have smoking 

14:46:06 

15 


information, right? 

14:46:06 

16 

A. 

Correct. 

14:46:06 

17 

Q. 

And as you've said, you don't use the BRFSS smoking 

14:46:18 

18 


information, either? 

14:46:18 

19 

A. 

Correct, in this disease model. 

14:46:22 

20 

Q. 

Right. And, therefore, even if we take the view 

14:46:32 

21 


that we're getting a SAF for the cell, you have 

14:46:38 

22 


people contributing to that SAF who were nonsmokers 

14:46:42 

23 


treated as smokers in the model; isn't that right? 

14:46:48 

24 

A. 

We have information from the Behavioral Risk Factor 

14:47:00 

25 


Survey people who match based upon demographics our 

14:47:04 
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1 


cell, some of them may be smokers, some not, I don't 

14:47:08 

2 


know. You know, I'm assuming that that's the case. 

14:47:12 

3 


But it's the average SAF which we 

14:47:16 

4 


calculate for all those people using this equation. 

14:47:18 

5 


as I've described, which we then applied to the 

14:47:22 

6 


cell. 

14:47:22 

7 

Q. 

To calculate an average, you had to have as an 

14:47:26 

8 


intermediate step a value for each person that 

14:47:30 

9 


becomes part of the average, right? 

14:47: 32 

10 

A. 

Yes. 

14:47:32 

11 

Q. 

And some of those people in the intermediate step 

14:47:36 

12 


are persons who did not smoke who, nonetheless, have 

14:47:40 

13 


a SAF? 

14:47:40 

14 

A. 

I don't disagree with your description of the 

14:47 : 44 
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15 

16 

17 

18 

19 

20 

21 Q. 

22 

23 

24 

25 


calculation. What I'm trying to make clear is that 
we're not in the business of calculating a smoking 
attributable fraction for a person on the Behavioral 
Risk Factor Survey. So when you say they have a 
SAF, I don't know what to make with that. I'm 
trying to disagree with that. 

I will stay with your agreement with the 
calculation. 

What is the interpretation of a SAF for a 
person who never smoked, in your intermediate step? 

MR. HAMLIN: Objection to form. 
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1 


THE WITNESS: Are you asking me about the 

14:48:22 

2 


calculation of a — 

14:48:24 

3 

BY 

MR. SILFEN: 


4 

Q. 

You've told me that in your intermediate step each 

14:48:28 

5 


of the persons, including people who do not smoke. 

14:48:32 

6 


would have a SAF. 

14:48:34 

7 


MR. HAMLIN: Objection; mischaracterizes 

14:48:38 

8 


his testimony. 

14:48:38 

9 


THE WITNESS: I did not say that. My 

14:48:38 

10 


answer was to the contrary. 

14:48:40 

11 

BY 

MR. SILFEN: 


12 

Q. 

I thought you said you agreed with the calculation? 

14:48:44 

13 

A. 

You misrepresented what I said. 

14:48:46 

14 

Q. 

Oh, my heavens, I certainly didn't mean to. 

14:48:50 

15 

A. 

Good. 


16 

Q. 

Now, let's replay this. I said that if you've 

14:48:58 
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17 

18 

19 

20 

21 A. 

22 

23 

24 

25 


gotten an average, then as an intermediate step you 
must have had a SAF for each of the persons. And I 
thought you said, "I agree with that as a 
calculation." Am I wrong? 

What I said is that we are calculating a SAF for a 
claims cell, that we'd calculate that SAF by taking 
the characteristics of individual persons in the 
Behavioral Risk Factor Survey. 

Those characteristics, most of which match 
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14:49:32 

14:49:36 

14:49:38 
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11 Q. 

12 

13 

14 A. 

15 

16 Q. 

17 

18 
19 
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the demographics that define the cell plus those 
additional ones, and we calculate a SAF with those 
values for those variables. 

Okay, that SAF doesn't necessarily belong 
to that person. It's a SAF that uses the couple 
variables from that person. 

And then we average those values over all 
the people and to obtain a SAF which we then apply 
to the cell. That's what I said. And if you want 
to consider this — well, that's what I said. 

For expense, to get an expense difference between a 
smoker and a nonsmoker, you have to compare a smoker 
and a nonsmoker, correct? 

We calculate differences in expenditures for 
groups. We take the — 

All right, a group of smokers and a group of 
nonsmokers. You have to have a group of nonsmokers 
to compare to a group of smokers to get a 
difference, right? 
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20 

A. 

Correct. 


14:51:48 

21 

Q. 

In the expense model, the 

expense part of the 

14:51:52 

22 


refined disease model, what group of smokers do you 

14:51:54 

23 


compare to nonsmokers? 


14:51:56 

24 

A. 

The data from NMES, using 

the data from NMES. 

14:52:00 

25 

Q. 

In the expense model? 


14:52:04 


174 


1 

A. 

Yes . 

14:52: 04 

2 

Q. 

The data from NMES? 

14:52:10 

3 

A. 

Yes . 

14:52:12 

4 

Q. 

What group of smokers and nonsmokers from BRFSS do 

14:52:34 

5 


you use to weight the expense calculation in the 

14:52:42 

6 


model or from the claims data? 

14:52:46 

7 

A. 

I'm sorry, could you read that back? I didn't quite 

14:52:52 

8 


get it. 

14:52:54 

9 


(The requested portion read back.) 

14:53:06 

10 


THE WITNESS: That's not the use we make 

14:53:14 

11 


of the behavior risk factor data in the disease 

14:53:20 

12 


model. 

14:53:22 

13 

BY 

MR. SILFEN: 


14 

Q. 

Now, the BRFSS people had their own smoking 

14:53:36 

15 


information, why didn't you use that? 

14:53:40 

16 

A. 

The information that was necessary to calculate a 

14:53:56 

17 


SAF using equation 3.1 in the document you gave me 

14:54: 02 

18 


requires us to know something about the joint 

14:54: 04 

19 


distribution of currently treated disease, smoking. 

14:54:10 

20 


and expenditures. 

14:54:10 

21 


We were able to estimate that joint 

14:54:12 
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22 distribution in NMES. We could not estimate that 

23 joint distribution in the Behavioral Risk Factor 

24 Survey. 

25 Q. That assumes that you're going to use exactly the 
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14 

15 

16 Q. 

17 

18 A. 

19 

20 
21 

22 Q. 

23 

24 
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model you already specified. My question doesn't 
assume that. 

My question is: Why did you specify a 
model that required you to make up smoking data for 
the BRFSS people? 

We did not make up smoking data for the BRFSS, for 
the Behavioral Risk Factor Survey data. We used the 
model that was the best statistical practice in 
order to estimate the quantity we wanted to know, 
the smoking attributable expenditures. 

And in order to do that, you needed to 
know something about the joint distribution of 
treatment, smoking and expenditure, which was 
available in NMES but not in the behavioral risk 
factor. 

Tell me again why this is the best statistical 
model, this one you have here? 

Because it allows us to calculate the smoking 
attributable fraction, which is what we need to 
apply to the expenditures in order to obtain the 
smoking attributable expenditures. 

We had a discussion this morning which we, I think, 
established NMES has the expenditure data you need 
and the covariates that you need. 
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25 


And I'm not sure you were able to explain 14:55:28 
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24 

25 


to me then why you couldn't just estimate a model 
which had expenditure as its outcome and smoking and 
other covariates as the predictors, you wouldn't 
have had to make up this data. Why not? 

A. The answer I gave you this morning was that I 

couldn't respond to your hypothetical model that you 
were sort of describing with lack of sufficient 
detail for me to on the spot give a critique of it. 

I tried to answer your questions about 
what we did and why our model enables us to address 
the question we were asked to address. I did not 
say that — I did not comment upon any failing of 
the model that we used or any other sort of 
suggestion. 

Q. I didn't suggest that you did. On the contrary 

you — I asked you a question why you didn't do it a 
different way, and you answered in effect because we 
did it this way and it's the best way. I think 
that's how we got back where we were. 

MR. HAMLIN: Wait, object on the grounds 
that mischaracterizes Dr. Zeger's testimony. 

MR. SILFEN: We'll read it. Perhaps it 
did. And if it did, I apologize. 

BY MR. SILFEN: 

Q. Let's just confirm your understanding of how 
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1 


covariates were used or not used in the model. 

14:56:58 

2 


What was the basis on which your group of 

14:57:08 

3 


covariates was originally chosen? Why did you use 

14:57:12 

4 


— why did you have that base group? 

14:57:14 

5 

A. 

These were the covariates which Dr. Samet indicated. 

14:57:18 

6 


of the ones that were available to us from NMES, 

14:57:20 

7 


would be most important to attempt to control for. 

14:57:26 

8 

Q. 

And you had no input on that at all? 

14:57:32 

9 

A. 

It was a discussion. He was the medical expert. We 

14:57:36 

10 


tried to derive the choice of variables based upon 

14:57:38 

11 


medical understanding to the extent possible. 

14:57:40 

12 

Q. 

Is there any distinction between the variables that 

14:57:46 

13 


Dr. Samet chose and ones that Leonard Miller had 

14:57:52 

14 


been using for five years? 

14:57:52 

15 

A. 

Is there a distinction? 

14:57:56 

16 

Q. 

Do you know? 

14:57:56 

17 

A. 

I don't know. 

14:57:56 

18 

Q. 

Now, the variables were kept or not kept depending 

14:58:12 

19 


on some tests of significance; is that correct? 

14:58:16 

20 

A. 

My understanding is that variables were retained in 

14:58:22 

21 


the model in groups. 

14:58:28 

22 


If one of a member of the group had a P 

14:58:30 

23 


value that was less than approximately .15, 

14:58:34 

24 


something like that I think is what Dr. Miller has 

14:58:36 

25 


indicated to me. 

14:58:36 


178 


http://legacyJibrary.ucsf.@3ijuijft^i^c|(i6§a0<2fei(sreif.industrydocuments.ucsf.edu/docs/rqfl0001 



1 Q. 

2 

3 

4 A. 

5 

6 Q. 

7 

8 
9 

10 A. 

11 Q. 

12 

13 

14 

15 A. 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 Q. 


Okay. So you — the way you're describing that 
sounds to me like you were not a part of the 
decision-making on this issue? 

I did not make the decision to include or drop out 
variables. 

What is your thinking about the practice of keeping 
variables that are not significant based on some 
kind of a nonstandard test, if that's a way to say 
it? Is it right, wrong, doesn't matter? 

Can you be more specific? 

Well, the variables were not kept because they met 
the standard 95 percent significance, they were kept 
on some other basis which you say is roughly .15. 
Why? What's the reasoning for that? 

It's actually not the standard that you would retain 
variables in a regression model if their P value was 
less than .05, that would not be a typically 
standard practice. 

Typically a looser criterion is used to 
admit variables to a regression model. Although, 
there's no hard and fast rule about this. Some 
people prefer to be a bit more liberal in it, and 
others a bit more conservative in terms of admitting 
variables. 

Some people would keep them all? 
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1 A. Some might keep it all. There's a trade-off of 15:00:08 

2 precision and an error in the estimation, and there 15:00:12 
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4 Q. 

5 


7 


8 A. 


9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


A. 


Q. 


A. 

Q. 

A. 


isn't a hard fast rule one way or the other. 

Would it also be appropriate to make the decision 
whether to keep a variable or not based on whether 
it has an outcome on the — has an effect on the 
outcome on the SAF? 

I'm not sure I understand the question. 

Well, it would be possible, for instance, to — for 
an insignificant variable to have an effect on the 
outcome that you're predicting, right? 

On the total smoking attributable expenditures, for 
example? 

Right. So another way to decide whether to keep or 
not keep a variable would be based on whether it 
affects the outcome, for instance, the smoking 
attributable expenditure, or, for instance, the 
smoking coefficient? 

That might be another way. 

And that was not done? 

No. What was done is what I — I mean, I'm only 
reporting what Dr. Miller has told me. I think his 
testimony would be most appropriate for that. 

That's fair. Would you agree that the most 
important results in the expenditure models are the 
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1 estimates of the coefficients multiplying the 

2 categorical measures of smoking history? 

3 A. I'm sorry, say that again. 

4 Q. I was just reading this sentence, footnote 30 

5 A. Certainly the coefficients for smoking in the 
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6 

7 Q. 

8 

9 A. 
10 
11 
12 

13 

14 

15 

16 

17 Q. 

18 

19 

20 

21 A. 

22 

23 

24 

25 Q. 


expenditure models are important variables. 

Well, this footnote says the most important 
results. Do you agree with that or don't you? 

I'm not exactly sure that's what's being contrasted 
with, I mean, results sort of includes a lot of 
things, and I don't know exactly how to. 

I, unfortunately, didn't write that one 
sentence so — I would certainly agree that the 
smoking coefficients are very important. And 
whether it's the most along what transit of scale 
I'm not sure, but it's very important, yes. 

Okay. Well, that leads back to the question we were 
on. Why wouldn't the better test for keeping or not 
keeping a variable being whether it affected the 
smoking coefficient? 

Well, that might be one criterion to consider. But 
another criterion is the strength of evidence that 
that variable is an important predictor, which is 
the one that was used here. 

But as far as you know, no test was done to 
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1 determine whether omitted variables would or would 

2 not affect the smoking coefficient? 

3 A. So far as I know. Although — but I didn't do 

4 that. 

5 Q. That's fine. 

6 A. And when you say — let me just be specific, we're 

7 talking about variables that we had the data 


15:04:54 
15:04:56 
15:05:02 
15:05:02 
15:05:04 
15:05:10 
15:05:12 


http://legacy.library.ucsf.@3iJUijft^i^c|(i6§a0QfipBif.industrydocuments.ucsf.edu/docs/rqfl0001 
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available that were in the model, and we're talking 
about whether to drop certain variables that didn't 

10 achieve a certain level of predictive ability. 

11 In that regard, I don't know of anything 

12 that was done looking at their relationship to the 

13 smoking variable. 

14 Q. Did you ever examine the other variables that were 

15 available in NMES that might have been used? 

16 A. I've looked at the NMES forms. It's been a while, 

17 but at some point I did look. 

18 Q. Well, one that's in there is exercise. Why did you 

19 choose not to put exercise in, especially since you 

20 seem to view it as important to do a little test of 

21 the significance of exercise in a footnote. In 

22 fact, I think the footnote we just looked at. 

23 A. Yeah, I don't recall the specific question about 

24 exercise. I recall there was a single exercise that 

25 I believe asked somebody to choose between one of 
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1 

2 

3 

4 

5 

6 

7 

8 
9 

10 


two alternatives that were a description of 
themselves, something about exercising three times a 
week for more than 20 minutes or something versus — 
it was another descriptor, I don't remember exactly 
what it was. 

But I remember it not being a very — 
well, there being some reservations about the 
question is the first point. 

The second point about the exercise 
variable was there was some concern that it would be 
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11 

12 

13 

14 

15 

16 

17 

18 

19 Q. 

20 
21 
22 

23 

24 

25 


what some call an endogenous variable, that someone 
would certainly answer that question that they don't 
exercise for 30 minutes every three times a week if 
they were, they had lung cancer, for example, or if 
they were otherwise sick with a smoking attributable 
disease. 

So I think those were the two major 
concerns about that variable. 

Well, my question really goes more along the lines 
of some of the other questions I've asked. You have 
exercise there in your NMES data, why wouldn't you 
use it? You would have to agree it is a 
well-recognized health factor, true? 

MR. HAMLIN: Objection; asked and 

answered. 
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1 THE WITNESS: Yeah, I believe I've — 15:07:40 


2 

BY 

MR. SILFEN: 


3 

Q. 

I'm sorry if you already answered that. But why 

15 : 07 : 44 

4 


wouldn't you just put the exercise — 

15:07:48 

5 


MR. SILFEN: I see why you're objecting. 

15:07:50 

6 


Tom. I'm meaning to put another slant on it. 

15:07:52 

7 

BY 

MR. SILFEN: 


8 

Q. 

I understand you're saying there was a concern it 

15:07:54 

9 


might be endogenous, but I'm not sure why you 

15:08:02 

10 


wouldn't put the variable in to see if it would 

15:08:06 

11 


significantly affect the smoking coefficient? 

15:08:08 

12 


MR. HAMLIN: Objection; asked and 

15:08:12 
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13 


answered. 

15:08:14 

14 

BY 

MR. SILFEN: 


15 

Q. 

It hasn't been asked and answered. I'm asking. 

15:08:16 

16 


notwithstanding the objections, wouldn't it have 

15:08:18 

17 


been the conservative thing to do to put the 

15:08:22 

18 


exercise variable in and see what happens? 

15:08:24 

19 


MR. HAMLIN: Same objection. Objection to 

15:08:26 

20 


form, as well. 

15:08:28 

21 


THE WITNESS: There are a whole set of 

15:08:32 

22 


things that one might do, and we are doing many of 

15:08:36 

23 


them. This is not one that we did for the two 

15:08:40 

24 


reasons I explained. 

15:08: 42 

25 

BY 

MR. SILFEN: 
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1 

Q. 

Diminished health status, are you considering 


15:09:10 

2 


dropping this part of the model? 


15:09:12 

3 

A. 

Am I? 


15:09:14 

4 

Q. 

Yeah. 


15:09:14 

5 

A. 

No. 


15:09:16 

6 

Q. 

The diminished health status model attributes $223 

15:09: 42 

7 


million to smoking. And I believe that one of our 

15:09:54 

8 


experts reported that $131 million of that was 


15:10:00 

9 


attributable to the pubsmk coefficient for males 

19 

15:10:06 

10 


to 34 . 


15:10:06 

11 


Were you aware of that? 


15:10:08 

12 

A. 

No. 


15:10:08 

13 

Q. 

Well, Dr. Miller was aware of that and said that 

was 

15:10:18 

14 


one of the things that your group was looking into. 

15:10:20 

15 


but you have no knowledge of that? 


15:10:22 
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16 A. Well, we were looking into the — I think a number 

17 of things about the — I'm trying to remember our 

18 discussion. 

19 It gets a little confused between the 

20 different models, so I'm trying to think about what 

21 we discussed about this one. 

22 Q. Think how I feel? 

23 A. Actually, I do recall a discussion about the point 

24 that a large fraction, I don't remember the numbers, 

25 but a large fraction of the expenditures came from 
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1 

2 

3 

4 

5 Q. 

6 

7 

8 
9 

10 A. 

11 Q. 

12 

13 

14 

15 A. 

16 

17 Q. 


one of the groups. 

And that was going to be something we were 
going to follow-up on, and I'm not sure what the 
status of that is right now. 

Okay. Let me briefly run through with you my 
understanding of the diminished health status model 
and see if I have it right. 

If we put aside for a moment the sample 
selection issue, the Mill's inverse ratio. 

Okay. 

I see three basic regressions here. The first is in 
the first the outcome is did a doctor ever tell you 
that you had one of four smoking-related diseases; 
is that correct? 

I believe that's correct for the persons 35 and 
older. 

And in that regression, there are a varied group of 
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19 

20 
21 
22 

23 

24 

25 


Q. 


A. 


covariates and smoke as a main effect. Are you 15 
aware — is that true? 15 
I believe so. I don't have the equation in front of 15 
me. My recollection is certainly smoking is in that 15 
regression. 15 
And there is then a second regression in which the 15 
outcome is self-reported, poor health status? 15 
Correct. 15 


1 Q. 

2 

3 

4 A. 

5 

6 
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8 

9 Q. 
10 
11 
12 

13 A. 

14 

15 

16 

17 Q. 

18 A. 

19 Q. 

20 


186 


And in that regression we use a fitted value from 15 
the first regression plus smoke and other 15 
covariates. 15 
For the 19 to 34-year-olds we used the reported 15 
value of previous disease. For the people 35 and 15 
older, we used the Emperical Bayes estimate of their 15 
underlying variable, given what they told us on the 15 
questionnaire. 15 
And then in a third equation, self-reported poor 15 
health status becomes a variable predicting 15 
probability and level of medical expenditure, along 15 
with pubsmk or privsmk, correct? 15 
That's my understanding. It's, again, the same sort 15 
of — it's the predicted value based upon both the 15 
regression from the previous step, but also on their 15 
reported value. 15 
For the self-reported poor health status? 15 
Yes. 15 
What is the interpretation of the pubsmk coefficient 15 
in that final regression? 15 
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21 

A. 

It's the — let's see, that final regression. 

15:14:36 

22 

Q. 

The outcome would be medical expense? 

15:14:40 

23 

A. 

Expenditure, yeah. 

15:14:42 

24 

Q. 

Self-reported poor health status would be one 

15:14:46 

25 


variable and pubsmk another? 

15:14:48 
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A. Yeah, so pubsmk would be the effect of smoking on 
public or for expenditures from the public domain, 
having controlled for the person's, our best 
estimate, of their health status, poor health 
status, as well as the other factors. 

Q. Why is the variable here pubsmk, but in the earlier 
regressions it's smoke? 

A. I think the decision was that because, again, of 

issues related to demand and access to allow for the 
possibility of a differential effect of persons 
receiving public versus private insurance. 

Whereas, the earlier equations are, again, 
more directed at the biologic process for which that 
was not considered as important, given the other 
factors in the model. 

Q. Why this structure? Why — in this structure we 

don't even have a direct input of smoking to medical 
expenditure, why? 

MR. HAMLIN: Objection; form. 

THE WITNESS: I don't understand your 

question. 

BY MR. SILFEN: 
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24 
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2 A 
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4 A 

5 Q 
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We specified a model here where we don't even have a 15:16:44 
direct smoking factor as a predictor of medical 15:16:48 
expense, it goes through self-reported health 15:16:50 


188 


status. Why? 

I don't agree. 

You don't agree? 

No. 

I take it you don't agree because of the pubsmk 
variable? 

Correct. 

And the pubsmk variable you've explained as having 
an interpretation of a demand function above your 
perceived health status, correct? 

This model allows for smoking to work in a couple of 
ways, either through self-reported poor health or 
directly to expenditures. 

Yeah, but, once again. Doctor, the outcome you're 
predicting is medical expenditures, correct? 

Correct. 

Why wouldn't you just model directly from smoking 
and the other covariates, including insurance, to 
medical expenditure? 

That would be another possibility. 

But why did you do it this way? 

Because this model attempts to describe the process 
by which expenditures might be elevated among 
smokers. That process could reflect two pathways; 
one which is more biologic in nature, and the other 
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1 


is having more to do with their use of medical 

15:18:34 

2 


services. 

15:18:36 

3 


So this model is, as all good models. 

15:18:42 

4 


mimicking the potential pathways, not simply being 

15:18:46 

5 


descriptive. 

15:18:48 

6 

Q. 

I take it, then, that if we modeled directly from 

15:18:50 

7 


smoking and the other covariates to medical expense 

15:18:52 

8 


we'd get the same result? 

15:18:54 

9 

A. 

I don't know. I've not done that, so I can't say 

15:18:56 

10 


you'd get the same result if I haven't done it. 

15:19:00 

11 

Q. 

Why wouldn't you? And what would it mean if you 

15:19:02 

12 


didn't get that result? 

15:19:02 

13 

A. 

I don't know. I don't know. It's not something 

15:19:04 

14 


I've done. I can't speculate about what would have 

15:19:06 

15 


happened, and then can't tell you why it would have 

15:19:10 

16 


happened if I don't even know what would have 

15:19:12 

17 


happened. 

15:19:12 

18 

Q. 

Did you ever sit down with someone saying why are we 

15:19:20 

19 


doing three regressions here? Why do we do previous 

15:19:28 

20 


disease and self-reported health status? Why don't 

15:19:32 

21 


we just find out if smoking has a direct effect on 

15:19:36 

22 


medical expenditures? 

15:19:36 

23 

A. 

I did not ask that question, no. Let me also add 

15:19:40 

24 


that you can find out what the effect of smoking is 

15:19:48 

25 


on medical expenditures this way, which is why we 

15:19:50 
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1 did it this way. 

2 Q. Why wouldn't you have a smoke variable in the final 

3 equation and a pubsmk variable, as well? 

4 A. If I understand the question, you're asking why 

5 wouldn't we have smoke plus pubsmk and private 

6 smoke, is that the question? 

7 Q. Yes. 


8 A. 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 

22 Q. 

23 

24 

25 


It's not necessary. If you have — if you define a 
variable, which indicates public smoke — and, 
again, I think there's the fraction of time issue 
that people were switching in and out of different 
insurance status. 

But let's just set that aside for the 
moment. If you had one variable for private smoke 
and another for public smoke, what you're doing is 
allowing the effect of smoking to be different for 
those two categories of insurance. 

It's not necessary to have a smoking 
variable — well, in fact, it wouldn't be possible 
if the smoking variable was the sum of the other 
two, it would be colinear. 

Let me go back to the place we started. Let's 
suppose that it's correct that in this model the 
pubsmk coefficient for 19 to 34-year-olds is 
responsible for more than half of the dollars. 
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1 


What would that tell you? 


15:21:58 
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2 A. It doesn't tell me anything in particular. 

3 Q. Well, come on, I mean, what is the interpretation? 

4 You have $231 million of smoking attributable 

5 expense, and let's assume that it is correct that 

6 the 19 to 34-year-old pubsmk coefficient is 

7 responsible for 131 million of it. 

8 What is that 131 million? How would you 


9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


interpret it? 

MR. HAMLIN: Objection; asked and 
answered, also on basis of form. 

BY MR. SILFEN: 

Q. $131 million of what? 

A. U.S. currency, I assume. 

Q. You know, you're the chief of biostatistics, and 

this is not a silly question. I don't know how to 
interpret that. I'm told that it's true. $131 
million is for 19 to 34-year-old pubsmk 
coefficient. This is your model. 

I don't think it's out of line for me to 
ask how he interprets that. And I didn't mean to be 
facetious. 

How do you interpret that? What does it 

mean? 

MR. HAMLIN: Objection; asked and 
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1 

2 

3 


answered. 15:23:16 

MR. SILFEN: I don't believe it was. 

THE WITNESS: It doesn't mean anything. 15:23:18 
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4 It means that there's an estimate of $131 million 

5 for the category that you named. That's the 

6 estimate of the smoking attributable expenditures 

7 for that category. 


8 BY MR. SILFEN: 

9 Q. Well, but I believe you said something to the effect 

10 that this was the nonbiological effect of smoking. 

11 Does that mean we have $131 million of 

12 nonbiological smoking attributable effect? 

13 A. It means for that category, the estimate of the 

14 smoking attributable expenditures was $131 million, 

15 just the way for some other category it was whatever 

16 it was the other category. 

17 Q. Yes, but this is a very specific category, this is 

18 19 to 34-year-old, people for whom in previous 

19 models you have ruled that they can't have a major 

20 smoking-related disease, right? 

21 A. They were — they were not in either the lung 

22 cancer/COPD or the CHD/stroke categories. 

23 Q. So it's not a major smoking-related disease. You 

24 said it's some kind of demand effect? 

25 A. I named that as one of the things that it might be. 
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1 

2 Q. 

3 

4 

5 

6 


yes. 

How can you account for the fact that you'd have 
$131 million of smoking attributable costs for 19 to 
34-year-olds in this category? 

MR. HAMLIN: Objection; asked and 

answered. 
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7 


MR. SILFEN: Tom, actually you may 

be 

15:24:36 

8 


right. It is true I've asked it, but it seems to me 

15:24:40 

9 


I'm looking for some — I'm looking for one 

of the 

15:24:44 

10 


creators of the model to give me some reason 

why 

15:24:46 

11 


that's a logical result. And if I've had his 

15:24:48 

12 


answer, then that's fine. 


15:24:50 

13 


MR. HAMLIN: I mean, if you want me to get 

15:24:54 

14 


into a speaking objection, I will. I think 

he has 

15:24:56 

15 


answered the question. 


15:24:58 

16 


MR. SILFEN: Okay. If that's the 

answer, 

15:25:04 

17 


that's it. 


15:25:04 

18 

BY 

MR. SILFEN: 



19 

Q. 

Do you know why there's two Mill's inverse ratios in 

15:25:32 

20 


the diminished health status model? 


15:25:34 

21 

A. 

In which equation? 


15:25:40 

22 

Q. 

I don't know. All the equations, isn't it? 

Well, 

15:25:48 

23 


I'm not sure. Are there two in some of the 


15:25:50 

24 


equations? 


15:25:52 

25 

A. 

Yes . 


15:25:52 
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1 Q. In which equations are there two? 

2 A. In the expenditure equation there's two. 

3 Q. That's the one I'm referring to you as the third 

4 where you're predicting medical expenditure? 

5 A. To the best of my recollection, there are two. 

6 Q. I try not to think about the Mill's inverse ratio 

7 very much, but why don't you tell us quickly why 

8 there are two, if you know? 
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9 A. 
10 
11 
12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


You have to think of the hypothesized underlying 
variables which describe your tendency to have poor 
health, et cetera. 

In this particular problem, we imagine 
that there's such a variable for disease, as well as 
for the sample selection process. 

And so you want to calculate things like 
the — in these models like the expectation of what 
you report for your health status, for example, 
given that we know you're not currently treated and 
given that we know that you were in the sample. 

And so this model actually posits a 
statistical amount for the joint distribution of the 
three underlying variables, the health status, 
tendency to be in the sample, and tendency to be 
currently treated. 

So the equation that's specified for, for 
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1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 


example, the poor health model is given that you are 
not currently treated and given that you are in the 
sample. 

And so that conditioning — actually, 
these Mill's ratios are obtained by actually writing 
down a model for the underlying variables and then 
sort of integrating out some stuff, and they pop 
into the equation because of formulas having to do 
with those integrals. 

So it's basically a selection process 
that's gone on both in terms of being in the sample 
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12 

13 

14 Q. 

15 A. 

16 Q. 

17 

18 

19 

20 

21 A. 

22 

23 

24 

25 


and in terms of not being in the currently treated 
group. 

I understand the — 

He'll ask tomorrow. 

I understand the sample bias issue because I had to 
deal with it for a couple years. 

The suggestion here is that there could be 
a bias in the sample because you are a person not 
currently treated, is that the fundamental notion? 

I wouldn't put it that way, no. I know in the one 
case we have a not being in the sample and now not 
being currently treated. 

I think an easier way to think about it is 
that there are three underlying variables, your 
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tendency to be currently treated, your tendency to 
be in the sample, and your tendency to have poor 
health. 

And the model that has been written down 
is for the joint distribution of those three 
underlying variables. 

Now when we go to estimate the parameters 
in one of those models, the poor health model, we 
know something about the other two. 

So the parameters which have been 
formulated in terms of this underlying three 
variables, we can't ignore the information we know. 

The information we know is that you are in 
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14 

15 

16 

17 

18 

19 

20 Q. 

21 
22 

23 

24 

25 


the sample and you don't have a currently treated 
disease. So when you say, okay, now I want to model 
the expected poor health, given that I know those 
two pieces of information, just write down the 
equations and it turns out that you end up with the 
Mill's ratio one for each of them. 

It's not your fault. Tell you what, let's take a 
few minutes and then we'll go into a final stretch. 

THE VIDEOGRAPHER: Temporarily going off 
the video record. The time is now 3:30 p.m. 

(A break was taken.) 

THE VIDEOGRAPHER: Back on the record. 
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The time is now 3:42 p.m. 

BY MR. SILFEN: 

Q. Tell you what, we'll try a different procedure 

here. You just tell me how you did the nursing home 
model. 

A. Let's see, the nursing home model actually uses the 
same approach as everything else we've done, which 
is to say we take total expenditures each year, 
which in this case were Medicaid expenditures, and 
we calculate a smoking attributable fraction. 

And we use not the NMES but the NHANES, 
the follow-up study of the NHANES study, to get — 
for the purpose of estimating the smoking 
attributable fraction. 

And we do that by taking all of the people 
— this period of follow-up we use is 1982 to 1992, 
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17 

18 

19 

20 
21 
22 

23 

24 

25 


an 11-year period, and we take everybody in NHANES 
who during that period turned 55 years old. 

I think there are approximately 1,000 such 
people, maybe nine hundred some people, about 1,050 
or so periods of time they are available for 
follow-up. 

And what we do is we construct from the 
records, from the follow-up survey of NHANES, the 
period of time we know where they are and we know 


15:43:34 
15:43:44 
15:43:46 
15:43:52 
15:43:56 
15:43:58 
15:44:02 
15:44:08 
15:44:12 


198 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 


whether they're in a nursing home or out of a 
nursing home. 

And then what we do is we stratify by 
gender and by one-year age classes. So, for 
example, we take all the 70 or all the persons who 
are 70 years old during this ten-year period of 
follow-up, and during their 70th year we calculate 
the fraction of time that they were in a nursing 
home, which is the number of days in a nursing home 
divided by the total number of days they were in our 
follow-up. That's an important point. 

If we only knew that they — where they 
were for 180 days, and they were in a nursing home 
for 90 of those days, that would be a ratio of a 
half. And then we average those ratios to get a 
fraction of time in the nursing home. 

And when I said we stratify, we stratify 
on gender, year, age, class and smoking, of course. 
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as well. 


15:45:18 


20 And then for each age and gender, we take 15:45:22 

21 the fraction of time for the smokers minus the 15:45:28 

22 fraction of time for the nonsmokers divided by the 15:45:30 

23 fraction of time for the smokers, and that's the SAF 15:45:36 

24 which we apply to the expenditures for persons in 15:45:40 

25 that age category and in that gender. 15:45:42 


1 Q. 

2 

3 A. 

4 Q. 

5 

6 A. 

7 Q. 

8 A. 

9 Q. 
10 

11 

12 

13 

14 

15 

16 A. 


17 


18 


19 


20 


21 
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I take it that in your example people could be in 
their 70th year in different calendar years? 

Yes . 

And there are no factors controlled for here, other 
than age, gender, smoking? 

Yes, correct. 

There are other factors in NHANES, do you know? 

I assume that there are. 

And in the case of lung cancer in the refined model 
you explained to me why you didn't control for any 
other factors. 

I think it was. I'll paraphrase, but 
because according to Dr. Samet the relationship 
between smoking and those diseases was so strong. 
What's the reason here? 

In our discussions, it was, we thought, the most 
important factor to admission, dominant factor to 
admission, to a nursing home is age. And so we 
stratified on that. And gender, we stratified on 
that. 

There was limited numbers of persons 
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22 

23 

24 

25 


available with the nursing home, you know, who 
qualified for this follow-up study. 

And I think because of the limited, I say 
limited, because of the roughly a thousand persons 
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4 

5 Q. 

6 

7 

8 
9 

10 

11 

12 

13 

14 A. 

15 

16 

17 

18 

19 Q. 

20 

21 A. 

22 
23 


or so, I think people felt it was better to fit a 
simpler model than a more complex one. 

There were some — well, those were the 
main reasons. 

Dr. Miller told us, I think, that — it was not 
inconsistent with what you're saying, but he added 
that you ran this many different ways and got 
greatly variable results depending on the 
specification, and that as a result you decided on a 
very simple model. 

Now, I don't warrant that's exactly what 
he said and we can look at that. But does that ring 
any bells as to other factors, other reasons? 

I don't recall whether — he tried one or two 
approaches to this problem. I think it was perhaps 
as I was coming into this and getting involved. 

I don't recall whether the discussion was 
that different variables that controlled for made — 
He just said different specification got very 
different results. 

Like I say, I don't know specifically what it was 
that led to the conclusion. When I got involved in 
the discussions of the nursing home, it was, you 


15:47:46 
15:47:48 
15:47:54 
15:47:54 
15:48:06 
15:48:08 
15:48:14 
15:48:18 
15:48:24 
15:48:26 
15:48:28 
15:48:30 
15:48:32 
15:48:40 
15:48:44 
15:48:48 
15:48:52 
15:48:58 
15:49:00 
15:49:02 
15:49:04 
15:49:08 
15:49:10 


http://legacy.library.ucsf.@3iJUijft^i^c|(i6§a0QfipBif.industrydocuments.ucsf.edu/docs/rqfl0001 



24 


know, we don't have a lot of data here, we should 


15:49:12 


25 try to do something that's simple and direct, as 15:49:14 
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1 opposed to trying to build a complex model, a more 15:49:18 

2 complex model like was done for the NMES data where 15:49:22 

3 we had considerably more information. 15:49:30 

4 Q. Now, at the beginning of the day, I think, or 15:49:38 

5 earlier in the day we read together from page 4 of 15:49:56 

6 your report where you said that — I'm going to just 15:50:04 

7 omit the word Blue Cross here from my reading — 15:50:08 

8 "The state paid substantially more nursing home 15:50:10 

9 residence fees for smokers than for never-smokers." 15:50:14 


10 


Do you see that? 

15:50:14 

11 

A. 

Yes. 

15:50:14 

12 

Q. 

And, in fact, what you calculated was $238 million; 

15:50:26 

13 


is that correct? 

15:50:26 

14 

A. 

I don't remember the exact number. I'm sure it's 

15:50:30 

15 


here someplace. 

15:50:30 

16 

Q. 

Yeah, it is in paragraph 7, actually. 

15:50:34 

17 

A. 

Where in paragraph 7, sir? 

15:50:42 

18 

Q. 

Last sentence. 

15:50:42 

19 

A. 

Two hundred thirty-eight, yes, point four million. 

15:50:44 

20 

Q. 

So I take it that is the substantially more? 

15:50:50 

21 

A. 

That's our estimate of it, yes. 

15:50:52 

22 

Q. 

More than what? 

15:50:54 

23 

A. 

We estimate that — I'm sorry, maybe I was confused 

15:51:02 

24 


in my last answer. We estimate that of the total 

15:51:06 

25 


expenditures by Medicaid for maintenance of persons 

15:51:12 
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1 


in nursing homes during this period 1978 to 1996, 

15:51:18 

2 


238.4 million of those expenditures are attributable 

15:51:22 

3 


to smoking. 

15:51:22 

4 


And that's 238 million out of the total 

15:51:28 

5 


expenditures, which I don't have in my copy but 

15:51:30 

6 


which are in the report. 

15:51:30 

7 

Q. 

Well, I'm referring back to the first sentence in 

15:51:36 

8 


paragraph 4 now where it says, "The state paid 

15:51:38 

9 


substantially more nursing home residence fees for 

15:51: 42 

10 


nonsmokers than for never-smokers." Is it 238 

15:51:48 

11 


million more? 

15:51:48 

12 

A. 

No. 

15:51:48 

13 

Q. 

How much more? 

15:51: 52 

14 

A. 

I don't know. 

15:51:54 

15 

Q. 

You mean there is not a correspondence between that 

15:51:58 

16 


sentence and the 238 million? 

15:52:02 

17 

A. 

I don't know what the direct correspondence is. 

15:52:14 

18 

Q. 

Do you know one way or another whether — do you 

15:52:20 

19 


know one way or another? 

15:52:20 

20 

A. 

What? 

15:52:22 

21 

Q. 

Why do you say that it's not 238 million more? 

15:52:26 

22 

A. 

Because — well, what 238 million is is the fraction 

15:52:30 

23 


of the actual expenditures by the state, which based 

15:52:36 

24 


upon our model built in NHANES, is attributable to 

15:52:40 

25 


smoking. 

15:52:42 
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1 

2 

3 

4 

5 

6 

7 Q. 

8 
9 

10 A. 

11 
12 

13 

14 

15 Q. 

16 

17 

18 A. 

19 Q. 

20 
21 

22 A. 

23 

24 

25 


I don't think, as I sit here, that that's 
the same as saying that 238 million more was paid 
for residence fees of smokers than for 
never-smokers. 

I don't think those two are equivalent 
statements. 

Well, do you think that the state paid substantially 
more nursing home residence fees for smokers than 
for never-smoker? 

That is the — what it says in the expert report. 

And I, at this moment, don't recall exactly what the 
basis — I think the basis of that sentence is the 
results of the modeling, but the number is not 238 
million. 

But if it's the result of the modeling, what result 
of the modeling is it? What other result do we 
have? 

I don't know. 

What is the difference between the two — between 
the two statements, the last sentence of 7 and the 
first sentence of 4? 

Well, the difference is that the last sentence in 
paragraph 7 says that we estimate that of the total 
expenditures by the state for this period of time, 
1976 — excuse me, for this period of time that 
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1 we're considering, of the total expenditures, 238.4 15:54:28 

2 million of them are attributable to smoking. 15:54:34 
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3 


And I've described for you the logic by 


15:54:36 


4 which we have arrived at that number. The first 15:54:42 

5 sentence in paragraph 4 says something about 15:54:46 

6 residence fees paid for by the state and the amount 15:54:52 

7 of fees paid for by smokers and never-smokers. 15:54:56 

8 And those are not — I don't think at this 15:55:00 

9 point in time those are equivalent statements. And 15:55:02 


10 I can't recall further what the basis for the 

11 sentence at the top of the paragraph — 

12 Q. Your concern is not residence fees, is it? 

13 A. No, my concern is not residence fees. 

14 Q. Okay. 

15 A. I may be missing something by the lateness of the 

16 hour, but as I sit here I do not think that they're 

17 equivalent. And I don't recall further the basis 

18 for the first sentence in paragraph 4. 

19 Q. Well, is it possible that the state did not pay more 

20 nursing home fees for smokers than never-smokers? 

21 A. I don't know. I don't know right now. 

22 Q. So this statement may just be flat wrong? 

23 A. I don't know. I don't think it's wrong, but I don't 

24 know. 

25 Q. Is it possible that the state did not pay more 
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1 nursing residence fees for smokers than for 

2 never-smokers, and yet there are $238 million of 

3 smoking attributable costs? 

4 A. I don't know. 



5 Q. I'm not asking whether that's true. I'm asking if 

6 that's possible? As a statistician, can that be? 

7 A. I don't know. 

8 Q. I take it that by saying you don't know that you 

9 don't rule it out. How could that be? 

10 A. I'm not ruling it in or I'm not ruling it out. I 

11 don't know at this moment. 

12 Q. Well, let's ask it another way. You say there's 238 

13 million of smoking attributable nursing home 

14 maintenance care, correct? 

15 A. Correct. 

16 Q. And I take it that that means that if there hadn't 

17 been any smoking, that there would have been $238 

18 million less for nursing home maintenance care? 

19 A. No. 

20 Q. So there's $238 million attributable to smoking, but 

21 there would not be $238 million less if there had 

22 not been smoking, correct? 

23 MR. HAMLIN: Objection; asked and 

24 answered. 

25 BY MR. SILFEN: 
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1 Q. Is that your position? 

2 A. That's what I said, yes. Excuse me, would you 

3 repeat that question? 

4 Q. You see; apparently, it wasn't asked and answered. 

5 A. I think it was asked and answered, but I just want 

6 to make sure you didn't change the question because 

7 I didn't hear it exactly right. 
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9 


(The requested portion read back.) 

THE WITNESS: There are $238 million 

10 attributable to smoking. That is not the same as 

11 saying that there would have been $238 million less 

12 had there been no smoking. 

13 BY MR. SILFEN: 

14 Q. Why? 

15 A. Because I don't know what would have happened in the 

16 world had there been no smoking. It would require 

17 lots of speculation about things that might have 

18 happened, which I'm not in a position to do. 

19 Q. So your model does not tell us whether there would 

20 have been more or less nursing home costs for the 

21 state had there been no smoking, correct? 

22 A. The model tells us the fraction of the actual 

23 expenditures that did occur that are attributable to 

24 smoking, which is not the same as the — well, 

25 that's what it tells us. 
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1 Q. For purposes of the record, I'd like an answer to 

2 the question, and I'm not meaning to press you on 

3 it. 

4 But is it correct, then, that your model 

5 does not tell us one way or another whether there 

6 would have been more or less nursing home costs to 

7 the state had there been no smoking? 

8 MR. HAMLIN: Objection; asked and 

9 answered. 
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10 THE WITNESS: Our model doesn't address 

11 that question. 

12 BY MR. SILFEN: 

13 Q. Does your model tell us whether smokers or 

14 nonsmokers in the period under study were more 

15 likely to enter nursing homes in their lives? 

16 A. It does not address that question specifically. 

17 Q. Does the model tell us if smokers or nonsmokers were 

18 likely to spend more time in nursing homes in their 

19 lives? 

20 A. It does not address that question specifically, 

21 either. 

22 Q. Does the state tell us — does the model tell us if 

23 the state spent more on smokers than on nonsmokers 

24 for nursing home care? 

25 A. It does not address that difference directly. 
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1 

2 Q. 

3 

4 A. 

5 

6 

7 

8 
9 

10 

11 

12 


either. 15:5 

As far as you know, is it entirely possible that 16:0 

smokers, in fact, spend less time in nursing homes? 16:0 
Well, this model looks at the fraction of time at a 16:0 

given age that smokers are in nursing homes relative 16:0 

to nonsmokers. 16:0 

And by virtue of the fact that this is a 16:0 

positive $238 million means that when averaged over 16:0 
all of the people for whom we had expenditures, that 16:0 
there's at a given age, gender, there's an average 16:0 

more time spent in nursing homes by, a larger 16:0 

fraction of time by the smokers than by the 16:0 
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13 

14 Q. 

15 

16 

17 A. 

18 Q. 

19 

20 
21 

22 A. 

23 

24 

25 


nonsmokers at a given age. 

But smokers are not — you don't know one way or 
another whether smokers are more likely to enter a 
nursing home in their lives? 

This calculation does not address that directly. 
Suppose the state said to you, "Dr. Zeger, we paid 
you a bit of money here to do work for us, we want 
to know will we have more or less nursing home costs 
if there is no smoking," what would your answer be? 
Well, they've not paid me any money to address that 
question. And if they wanted me to address that 
question, it would be a very expensive adventure. 

I mean, I'd have to start and address a 
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1 

2 

3 Q. 

4 

5 

6 A. 

7 

8 
9 

10 

11 

12 Q. 

13 

14 
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different question. It's not the one I've addressed 
here or we've addressed here. 

And how was it decided that you would not address 
any of those questions that we have gone through 
here? 

We addressed the same question here that we 
addressed in all the other parts of the analysis 
that we've talked about today. 

We said what would be the actual 
expenditures for the state and what fraction of 
those are attributable to smoking. 

When you compared the smoker with the nonsmoker on 
an annual basis, you took the fractional part of a 
year spent in a nursing home by a smoker and you 
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15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


A. 

Q. 

A. 

Q. 


subtracted the fractional part of a year spent in 
the nursing home by a nonsmoker, correct? 

We took all of the smokers and all of the nonsmokers 
in a given stratum, yes. 

I didn't mean to make it personal. 

Yeah, compared the difference in the fraction of 
time they were in the nursing home. 

So you compared, for the stratum you were looking 
at, you compared a smoker with a never-smoker, 
correct? 

No. 
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1 Q. 

2 

3 

4 

5 A. 

6 

7 

8 
9 

10 

11 Q. 

12 

13 A. 

14 

15 

16 Q. 

17 


You compared a smoker time in a nursing home for the 
given age group to the nonsmoker or the never-smoker 
time in a nursing home for a given age group, 
correct? 

For a given age and gender we took all of the 
smokers and all of the nonsmokers and we took the 
average fraction of days for the smokers and 
compared that to the average fraction of days for 
the nonsmokers, never-smokers, in that age and 
gender category. 

And you assumed that the difference between those 
two was an extra attributable to smoking, right? 

We assumed that the difference between those two, as 
a ratio to the bigger, is the fraction of additional 
time that's attributable to smoking. 

So what you've said is that — you've calculated, I 
take it, on an annual basis, the cost that the state 
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18 

19 

20 

21 A 

22 Q 

23 

24 

25 A 


1 Q 

2 

3 

4 A 

5 Q 

6 

7 

8 A 

9 

10 Q 

11 

12 A 

13 Q 

14 

15 

16 A 

17 

18 Q 

19 A 


would have had or would have saved if smokers had 
had the same nursing home stay as nonsmokers on 
average for that year? 

No. 

Oh, well, I — let's take one year. Let's say there 
was only one year in the study, people 70 years 
old. Okay? 

Okay. 
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For those people 70 years old, let's say the average 
nursing home stay was ten days for smokers and five 
days for nonsmokers. Okay? 

Okay. 

That's the average over all of the people in that 
stratum. So the excess days that you would say were 
attributable to smoking is five, correct? 

We calculated it in a fraction, an excess fraction 
of days. 

I understand. I could put in the denominator and 
the denominator would be 10. 

The denominator would be 10 in a year? 

Well, what you would — what the denominator would 
be is the days that the smoker spent, the average 
days that the smoker spent in the nursing home? 
That's not what we did. I mean, I don't mean to be 
argumentative, but that's not what we did. 

That's fine, then tell me what you did. 

What we did is we took for each person the fraction 
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20 

21 

22 

23 

24 

25 


of time they were in. So if the person was there in 1 
our study for a whole year, and they were in for 180 1 


days, their fraction was a half, I mean, roughly a 1 

half. 1 

If somebody was only in our study for two 1 

days and they were in the nursing home for one day, 1 
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10 Q. 

11 
12 

13 A. 

14 

15 

16 

17 Q. 

18 

19 

20 

21 A. 

22 Q. 


that was also a half. So you can't just take the 1 

number of days. You have to look at the 1 

denominator, as well, which is the total days they 1 

were at risk of being in a nursing home. 1 

And we calculated the average fraction for 1 
the smokers and compared that to the average 1 

fraction for the nonsmokers, which is different from 1 
taking the numbers of days for the smokers and 1 

subtracting the number of days for nonsmokers. 1 

And the difference between those two fractions you 1 

said was the excess nursing home time due to 1 

smoking? 1 

We said it was the fraction, that thing divided by 1 


the average fraction of time for the smokers was the 1 


smoking attributable fraction that we applied to the 1 
expenditures for nursing home stays. 1 
And so for each year you were charging smoking with 1 
responsibility for any difference between the 1 
fraction of time spent by a smoker, the smokers 1 
population wide, and nonsmokers? 1 
Correct. 1 
And why is that extra, that difference which we've 1 
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23 


just identified, not the difference for that year 


16:08:42 


24 that is attributable to people smoking? 16:08:48 

25 A. I don't understand that question. 16:08:54 
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1 MR. HAMLIN: Objection. 

2 BY MR. SILFEN: 


3 

Q 

4 


5 

A 

6 

Q 
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8 

A 

9 

Q 

10 


11 

A 

12 

Q 

13 


14 


15 


16 

A 

17 

Q 

18 

A 

19 


20 


21 


22 


23 


24 



Is that difference a difference that you say is 
attributable to smoking? 

Yes, and it's for a year of age. 

Is that something different than saying that it is 
attributable to the fact that people smoke? 

No. Similar; different words. 

And then is it not also true that that difference 
would disappear if people did not smoke? 

That is not necessarily true. 

So you say that is a difference that is attributable 
to people, to the fact that people smoked, but that 
difference would not disappear if people did not 
smoke? 

That's correct. 

How can that be? 

Well, there are lots of factors that might change if 
there was no smoking in the world. And in order to 
make this comparison, I have to have a certain 
population of people, and I have to have smokers and 
nonsmokers, and you're talking about a different 
world. 

I don't know that the data would look the 
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way it does if we had a different world without 


16: 
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smoking. So I can't tell you that they're the 
same. 

Q. And why didn't you look at those other factors that 
you described that would have changed? 

A. Well, because I can only speculate about what might 
have changed. And it wasn't relevant to the 
question that I addressed in this report. 

Q. Well, which is it or both? Is it that it's not 
relevant or that it was speculative? 

A. It's both. It's both. The question — I was able 

to address the question, we were able to address the 
question that was asked, which had to do with 
attributing the actual expenditures without having 
to speculate about how things might have been in 
another world, and that's why we approached the 
problem the way we did. 

MR. SILFEN: Why don't we take five 

minutes . 

(A break was taken.) 

THE VIDEOGRAPHER: We're back on the video 
record. This is the fourth tape of the video 
deposition of Scott Zeger. The time is now 4:16 

p ,m. 

BY MR. SILFEN: 

Q. Back on the record. Having given you an early 
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1 


warning, you do know what a life table is? 

16:16:46 

2 

A. 

Yes . 

16:16:46 

3 

Q. 

Are life tables speculative? 

16:16:48 

4 

A. 

They can be. 

16:16:50 

5 

Q. 

Are they any more speculative than the data in 

16:16:54 

6 


NHANES or in NMES? 

16:16:58 

7 

A. 

A life table can be filled with speculative numbers. 

16:17:02 

8 


or it can be filled with numbers estimated 

16:17:06 

9 


accurately from data. 

16:17:06 

10 

Q. 

Are there reliable life tables for smokers and 

16:17:10 

11 


nonsmokers? Are you familiar with ACS million 

16:17:14 

12 


person study, for instance? 

16:17:16 

13 

A. 

Yes, I am. 

16:17:16 

14 

Q. 

Do you believe that there are reliable life tables 

16:17:18 

15 


for smokers and nonsmokers? 


16 

A. 

I don't know the extent to which there are reliable 

16:17:22 

17 


life table studies. I'm somewhat familiar with that 

16:17:24 

18 


study, but I don't know about the life tables. 

16:17:26 

19 

Q. 

I was just picking a study out of the air. But is 

16:17:28 

20 


it your position that there are not reliable life 

16:17:32 

21 


tables for smokers and nonsmokers? 

16:17:34 

22 

A. 

It's not my position that there are or that there 

16:17:36 

23 


aren't. 

16:17:36 

24 

Q. 

You don't know one way or the other? 

16:17:38 

25 

A. 

I don't know about the life table data on smokers or 

16:17:42 
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1 


nonsmokers. 


16:17:42 


2 Q. If there are reliable life table information for 16:17:46 

3 smokers and nonsmokers, say as reliable as NMES and 16:17:50 

4 NHANES, then why would the life expectancy 16:17:54 

5 difference between smokers and nonsmokers be 16:17:58 

6 speculative? 16:17:58 

7 MR. HAMLIN: Objection to foundation. 16:18:00 

8 THE WITNESS: I didn't say that the, 16:18:02 

9 conditioned on the premise, that there are accurate 16:18:10 


10 life tables for smokers and nonsmokers. I did not 

11 say that the life tables had to be speculative or 

12 using the life tables had to be speculative. 

13 I used the word speculative in response to 

14 a particular question you asked me previously. 

15 BY MR. SILFEN: 

16 Q. Well, all right. Let's assume that if people didn't 

17 smoke, one of the things that would change would be 

18 their life expectancy. 

19 Do you believe that to be true? 

20 A. I do believe that, that's true, yes. 

21 Q. Do you think the change in their life expectancy is 

22 speculative? 

23 A. What? 

24 Q. If they didn't smoke. 

25 A. If they didn't smoke, I would — I mean, from what 
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1 I've read, and I've not studied this topic, but from 

2 what I've read I would expect on average people who 

3 don't smoke live longer than people who do smoke. 



4 Q. 


Do you have any reason to believe that the data 


16:19:00 


5 indicating the life expectancy difference between 16:19:04 

6 smokers and nonsmokers is any more or less reliable 16:19:08 

7 than the data indicating the disease incidence 16:19:16 

8 difference between smokers and nonsmokers? 16:19:16 

9 MR. HAMLIN: Objection to form and 16:19:16 


10 foundation. 

11 THE WITNESS: I'm sorry, you'll have to 

12 ask it again. 

13 BY MR. SILFEN: 

14 Q. Let me ask it in two parts. You have used here NMES 

15 to derive a comparison of smoker and nonsmoker 

16 disease incidence experience, correct? 

17 A. No. 

18 Q. All right. You've used NMES to compare smoker and 

19 nonsmoker rates of disease, correct? 

20 A. For the prevalence rates, correct. 

21 Q. Do you believe that the studies comparing smoker and 

22 nonsmoker expected life span are any less reliable 

23 than the data you used in NMES to compare smoker to 

24 nonsmoker? 

25 A. I have no basis to know one way or the other. 
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1 Q. Let's assume that that data is as reliable, then why 

2 would it be speculative when we compare smokers to 

3 nonsmokers to also hypothesize what the difference 

4 would be in their life-span if smokers didn't 

5 smoke? 



7 


MR. HAMLIN: Objection to form, 
foundation, assuming facts not in evidence. 

8 THE WITNESS: I'm not sure I understand 

9 the relevance of the question to what — 

10 BY MR. SILFEN: 

11 Q. Well, then I will tell you the relevance. You said 

12 that the reason you didn't consider another world, a 

13 different world, is because it would be 

14 speculative. That was one of your reasons, right? 

15 A. I said there were two reasons why we did what we 

16 did. 

17 Q. I'm addressing one of them now. You said one of 

18 them was that it was speculative, right? 

19 A. I said it would be more speculative, yes. 

20 Q. And I'm addressing that now. And my first point was 

21 that you used NMES to compare smokers and 

22 nonsmokers, correct? That's the point I just made. 

23 Okay. 

24 And then I asked you whether the life 

25 table data on smokers and nonsmokers was any more 
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1 

2 

3 A. 

4 Q. 

5 

6 

7 

8 


speculative than the data in NMES you relied on, and 
you said you didn't know? 

That's correct, that's what I said. 

So do you have any reason to believe that it would 
be more speculative to hypothesize what would happen 
to life-span if smokers didn't smoke than it would 
what would happen to disease rates? 

MR. HAMLIN: Objection to form and 
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foundation. 


16:21:32 

10 


THE WITNESS: I mean, the question is. 


16:21:34 

11 


given that there's no difference in the precision 

16:21:36 

12 


between the NMES data and this life table data. 

is 

16:21:40 

13 


there any difference in the precision, is there 

any 

16:21:42 

14 


more speculation? 


16:21:44 

15 

BY 

MR. SILFEN: 



16 

Q. 

I understand, it's a tautological question. 


16:21:48 

17 

A. 

You know, what's the purpose of — if A is true. 


16:21:54 

18 


then A is true. 


16:21:54 

19 

Q. 

You told me that age is the principal determinant of 

16:21:58 

20 


nursing home entry, didn't you, you said that? 


16:21:58 

21 

A. 

I did not say that, no. 


16:21:58 

22 

Q. 

You said age is a main determinant, right? 


16:22:02 

23 

A. 

I did not say that, no. I said age is a main 


16:22:04 

24 


determinant of the probability somebody is in a 


16:22:06 

25 


nursing home. I did not say — I was talking about 

16:22:08 


220 


1 

prevalence not incidence. 



16:22:10 

2 Q. 

Well, okay. I'll take it. Well, 

we can — 

do you 

16:22:18 

3 

want to look for what you said? 

Why don't 

we look 

16:22:20 

4 

for what he said. 



16:22:20 

5 

(The requested portion read 

back.) 


16:22:58 


6 MR. SILFEN: "In our discussions, it was, 

7 we thought, the most important factor to admission, 

8 dominant factor to admission, to a nursing home, is 

9 age." 

10 BY MR. SILFEN: 
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11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 


Q. Now, if the most important factor is age and if the 
age difference, expectancy difference between 
smokers and nonsmokers, is not speculative, why 
wouldn't we take that into account? 

A. I'm sorry, repeat the question. 

MR. HAMLIN: Objection; form. 

THE WITNESS: For what purpose? 

BY MR. SILFEN: 

Q. You said that you didn't answer the question of what 
would have happened had people not smoked, in part, 
because it was speculative, because of the other 
factors that might be involved, correct? 

A. What I said was that to address the question that we 
were asked, what fraction of actual expenditures are 
attributable to smoking, it was not necessary to 


16:23:06 

16:23:10 

16:23:14 

16:23:14 

16:23:22 

16:23:42 

16:23:42 

16:23:50 

16:23:54 

16:23:58 

16:24:00 

16:24:06 

16:24:10 

16:24:14 
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1 

2 Q. 

3 

4 

5 

6 A. 

7 

8 
9 

10 

11 Q. 

12 
13 


look at these. Well, I didn't say — what I — 

Dr. Zeger, you also said it was speculative, and 
that's the part I'm addressing. I'm not speaking to 
the nature of your task. I'm speaking to your 
statement that it was speculative. 

What I'm trying to add and say to you now is that 
it's — it was more speculative to have to conjure 
up what might have been than it was to use the data 
which directly addressed the question that we had at 
hand. 

The data you used compared smokers and nonsmokers 
for their rate — their different rate of disease, 
correct? 


16:24:22 

16:24:26 

16:24:28 

16:24:30 

16:24:34 

16:24:36 

16:24:42 

16:24:44 

16:24:48 

16:24:48 

16:24:52 

16:24:58 

16:24:58 
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14 

A. 

For their rate of being in a nursing home. 

16:25:02 

15 

Q. 

Okay. Their rate of being in a nursing home? 

16:25:04 

16 

A. 

Correct. 

16:25:04 

17 

Q. 

And why is it more speculative to address the 

16:25:10 

18 


additional factor of their different life expectancy 

16:25:12 

19 


if they didn't smoke? 

16:25:14 

20 


MR. HAMLIN: Objection to form and 

16:25:16 

21 


foundation. 

16:25:16 

22 


THE WITNESS: The point I made, the 

16:25:18 

23 


comment I made about speculative has to do with 

16:25:22 

24 


trying to figure out how the world might have been 

16:25:24 

25 


if people didn't smoke. 

16:25:26 


222 


1 One of the aspects of that world might 

2 have been their life expectancy and other aspects 

3 might have been other things. 

4 BY MR. SILFEN: 

5 Q. But the point I'm making is that it is not at all 

6 speculative or at least certainly no more 

7 speculative than the data that you use? 

8 A. I disagree with that. 

9 Q. Really? Do you think the data comparing smoker and 

10 nonsmoker disease risk is more or nursing home risk 

11 is more reliable than the data on different — on 

12 life expectancies for smokers and nonsmokers? We 

13 just went through. You said you didn't know. 

14 MR. HAMLIN: Objection; form. I also 

15 object on the grounds the question is 


16:25:28 

16:25:30 

16:25:32 

16:25:34 

16:25:36 

16:25:42 

16:25:42 

16:25:46 

16:25:50 

16:25:52 

16:25:56 

16:25:56 

16:25:58 

16:26:00 
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16 argumentative. 

17 BY MR. SILFEN: 

18 Q. I agree with you it's been asked and answered, he 

19 said he didn't know. Now, do you or do you not 

20 know? 

21 A. What? 

22 Q. Is the data on disease rates, difference between 

23 smokers and nonsmokers, any more reliable that the 

24 life expectancy rates? 

25 MR. HAMLIN: Same objections. 


16:26:00 

16:26:02 

16 : 26:06 

16 : 26:06 

16 : 26:06 

16:26:10 

16:26:14 

16:26:18 

16:26:18 
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1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 


THE WITNESS: I answered I don't know and 
I don't know. 

BY MR. SILFEN: 

Q. Do you believe that the NHANES smoker population and 
nonsmoker population that you examined had a 
different age distribution? 

A. I don't know. 

Q. Why wouldn't it be a simple solution to this problem 
to simply leave them with whatever age distribution 
they had? In other words, not age standardized, and 
compare their nursing home days? 

A. Because that fails to control for the most important 
factor for being in a nursing home as opposed to 
admission to a nursing home for prevalence of 
nursing home days, which is age. 

Q. Do you believe that if people stopped smoking that 
age difference between them, if there is one, would 
persist? 


16:26:20 

16:26:20 

16:26:36 

16:26:40 

16:26:44 

16 : 26:46 

16:27:00 

16:27:02 

16:27:10 

16:27:12 

16:27:16 

16:27:18 

16:27:22 

16:27:24 

16:27:26 

16:27:28 

16:27:28 
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19 A. 

20 Q. 

21 
22 

23 

24 

25 


I don't know. 

If any age difference between smokers and nonsmokers 
would go away if they stopped smoking, why would you 
age adjust? What sense does that make? 

MR. HAMLIN: Objection to form and 
foundation, also repetitive. 

THE WITNESS: Because it allows me to 


16:27:30 

16:27:34 

16:27:38 

16:27:40 

16:27:42 

16:27:44 

16:27:46 
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1 


address the question which was asked. 


16:27:48 

2 

BY 

MR. SILFEN: 



3 

Q. 

I see. 


16:27:50 

4 

A. 

Which is what fraction of the dollars actually 


16:27:54 

5 


expended for nursing home are attributable to 


16:27:58 

6 


smoking. 


16:27:58 

7 

Q. 

Suppose that their age expectancy is a function 

of 

16:28:04 

8 


their smoking. Is it then correct or even 


16:28:12 

9 


appropriate to age adjust? 


16:28:14 

10 

A. 

If the question that you're asking is the question 

16:28:18 

11 


that we addressed, yes. 


16:28:18 

12 

Q. 

So basically you were doing what you were told 

and 

16:28:28 

13 


it has nothing to do with speculation or data or 

16:28:30 

14 


anything else; is that right? 


16:28:32 

15 

A. 

I was choosing the best statistical methods to 


16:28:36 

16 


address the question which was asked, which was 

what 

16:28:38 

17 


is the smoking attributable expenditures for 


16:28:40 

18 


smoking? 


16:28:40 

19 

Q. 

Now, one of our experts approached the problem 


16:29:34 

20 


differently, took the smokers and nonsmokers as 

he 

16:29:42 
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21 

22 

23 

24 

25 


found them, and for the ten-year period counted up 16:29:50 

nursing home days for the smokers and nursing home 16:29:54 

days for the nonsmokers. 16:30:00 

And in each case, the denominator was the 16:30:04 
people starting out, so he basically got an average 16:30:08 


1 

2 

3 

4 

5 

6 


7 

A 

8 

Q 

9 


10 


11 


12 


13 


14 

A 

15 

Q 

16 


17 


18 


19 


20 

A 

21 

Q 

22 


23 



225 

nursing home days for the ten-year period for all 
smokers, whether they went in or not, and all 
nonsmokers, whether they went in or not. 

Are you with me? I'm not asking you to 
approve, I'm just asking if you understand what I'm 
saying. 

I'm not 100 percent sure what he did, no. 

He just took, he looked at the whole ten-year 
period, took the smokers, counted up their nursing 
home days, took the nonsmokers, counted up their 
nursing home days, and he averaged those over all 
the smokers who started and all the nonsmokers who 
started. 

Right. 

So what it was, in one I believe that the effect is 
to measure for the ten-year period both probability 
of entry and duration of stay and takes into account 
smoking prevalence, as well? Doesn't it do all 
those things? 

I don't know. 

Okay. Now, his finding was that nonsmokers were 
much more likely to enter nursing homes and stayed 
longer when they entered. Do you have any reason to 


16:30:14 

16:30:18 

16:30:20 

16:30:22 

16:30:24 

16:30:28 

16:30:30 

16:30:32 

16:30:36 

16:30:40 

16:30:44 

16:30:44 

16:30:44 

16:30:50 

16:30:56 

16:31:02 

16:31:04 

16:31:06 

16:31:06 

16:31:16 

16:31:22 

16:31:24 
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24 


doubt that result? 


16:31:26 


25 A. I don't have any reason to believe it nor to doubt 16:31:30 


226 


1 


it. I don't know. 

16:31:30 

2 

Q. 

Okay. What question does that analysis answer? 

16:31:34 

3 


MR. HAMLIN: Objection to form. 

16:31:38 

4 


THE WITNESS: That analysis would address 

16:31:44 

5 


the average numbers of days that a nonsmoker was in 

16:31:56 

6 


a nursing home minus the average numbers of days a 

16:32:02 

7 


smoker was, without paying attention to what's the 

16:32:06 

8 


critical factor in life table or other analysis. 

16:32:10 

9 


which was the time that a person was at risk to be 

16:32:12 

10 


there. 

16:32:14 

11 

BY 

MR. SILFEN: 


12 

Q. 

Well — 

16:32:16 

13 

A. 

I think. 

16:32:20 

14 

Q. 

Well, I don't understand that, but we'll come back 

16:32:24 

15 


to that. 

16:32:24 

16 


Let's mark as the next exhibit the chart 

16:32:40 

17 


done by Tim Wyant that we referred to earlier in 

16:32:48 

18 


connection with a statement in your report. 

16:32:56 

19 


paragraph 4 on page 4. It's the second sentence in 

16:33:14 

20 


that paragraph 4, "smokers entering nursing homes," 


21 


do you see that? 


22 

A. 

Yes. 


23 

Q. 

And you recall we talked about that earlier? 

16:33:18 

24 

A. 

I do recall indicating that I thought Tim had done 

16:33:20 

25 


some calculations related to that. 

16:33:22 
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1 

Q. 

And you said you'd looked at it — 

16:33:24 

2 

A. 

Briefly. 

16:33:26 

3 


(Defendants' Exhibit 2403 marked for 

16:33:44 

4 


identification by the reporter.) 


5 

BY 

MR. SILFEN: 


6 

Q. 

I will give you Exhibit 2403 and ask you if that is 

16:33:50 

7 


the chart that you remember looking at earlier? 

16:33:56 

8 

A. 

I believe so. 

16:34:12 

9 

Q. 

And what do you understand the chart to represent? 

16:34:16 

10 

A. 

I believe this represents the diagnosis of persons 

16:34:28 

11 


entering — upon entrance to a nursing home 

16:34:32 

12 


comparing nonsmokers to smokers. 

16:34:38 

13 


I believe it gives the numbers of persons 

16:34:40 

14 


with different smoking attributable diseases. And I 

16:34:44 

15 


believe it's upon entrance to nursing home, but I'm 

16:34:48 

16 


not 100 percent certain. 

16:34:48 

17 

Q. 

But it was over the entire ten years of the 

16:35:00 

18 


follow-up, I believe, I think from what I recall. 

16:35:06 

19 


Let me back up. 

16:35:10 

20 


The study I described as having been done 

16:35:14 

21 


by a defense expert, is that a longitudinal study of 

16:35:22 

22 


nursing homes? 

16:35:22 

23 

A. 

I think I would call that a longitudinal study. 

16:35:26 

24 

Q. 

And also an incidence study? 

16:35:28 

25 

A. 

No, I don't think I would call it an incidence 

16:35:38 


228 
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1 


study. 

16:35:40 

2 

Q. 

Why not? 

16:35:42 

3 

A. 

Well, you know, you have to be more specific about 

16:35:48 

4 


what he did. 

16:35:48 

5 

Q. 

That's fair. That's fair, why don't we wait. Is 

16:35:58 

6 


this a longitudinal study that Dr. Wyant did? 

16:36:02 

7 

A. 

Well, the data that we used in our calculations were 

16:36:08 

8 


longitudinal data, yes. 

16:36:10 

9 

Q. 

Is this an incidence study that Dr. Wyant did? 

16:36:12 

10 

A. 

No. 

16:36:14 

11 

Q. 

At the bottom you see for nonsmokers the totals. 

16:36:32 

12 


persons with any tobacco related disease and persons 

16:36:36 

13 


with no tobacco related disease, do you see that? 

16:36:40 

14 

A. 

I see those two rows, yes. 

16:36:42 

15 

Q. 

And the total there would be 415 persons, 313 plus 

16:36:50 

16 


102? 

16:36:50 

17 

A. 

For nonsmoking? 

16:36:54 

18 

Q. 

Yes . 

16:36:54 

19 

A. 

415 would be the sum of any tobacco-related diseases 

16:36:58 

20 


and no tobacco-related diseases. 

16:37:00 

21 

Q. 

And for smokers how many people do we have entering 

16:37:06 

22 


nursing homes? 

16:37:06 

23 

A. 

Let's see, by my calculations 259, the sum of 213 

16:37:18 

24 


and 46. 

16:37:20 

25 

Q. 

Now, earlier today we talked about the data that you 

16:37:26 


229 


1 had reported on the prevalence of smoking in 16:37:30 
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2 


Minnesota. Do you remember that? 


16:37:30 


3 

A 

4 

Q 

5 


6 

A 

7 


8 

Q 

9 


10 


11 

A 

12 


13 


14 

Q 

15 


16 


17 


18 


19 


20 


21 

A 

22 

Q 

23 


24 

A 

25 

Q 


I don't recall that discussion, no. 

Well, we looked together at chart 1, Table 1, on 
page 2 of your report. 

Yes, this is for persons 19 and above for the period 
1984 to '94. 

Okay. And I think what you told me that for persons 
19 and above during this period at least 50 percent 
were ever-smokers, right? 

Yes, in these two categories for persons 19 and 
above, more than 50 percent were current or former 
smokers. 

Let's assume that that 50/50 split persists right up 
through the age groups. I don't know that it does, 
but let's assume that that is correct. 

Would Dr. Wyant's paper then tell us that 
nonsmokers entered nursing homes during this NHANES 
period at, what, one and a half times the rate of 
nonsmokers, of smokers? 

I don't know. 

Well, you don't know because we haven't done the 
math or — we can do the math. 

I don't know. 

Well, I'll do the math. No, I won't. I think it's 


16:37:38 

16:37:44 

16:37:46 

16:37:56 

16:37:58 

16:38:00 

16:38:06 

16:38:08 

16:38:14 

16:38:18 

16:38:18 

16:38:24 

16:38:28 

16:38:30 

16:38:38 

16:38:44 

16:38:50 

16:38:52 

16:38:58 

16:39:00 

16:39:02 

16:39:04 

16:39:22 
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1 1.6. Let's assume we had a calculator and we 

2 divided 415 by 259 and we got 1.6. 

3 Would Dr. Wyant's study then tell us that 

4 over the NHANES period that nonsmokers entered 



5 

6 

7 A. 

8 Q. 

9 A. 
10 

11 

12 

13 

14 Q. 

15 

16 

17 

18 

19 

20 
21 

22 A. 

23 Q. 

24 

25 


nursing homes at about a 60 percent greater rate 
than smokers? 

I don't know that that's what I would say. 

Why? What is it that we don't know? 

Well, I don't know specifically what assumptions 
you're making or what calculation that you're 
making. 

You've calculated the ratio 415 to 259 and 
you've said — I don't know what that means. 

Well, I'd like to understand this, so I didn't mean 
to make assumptions. 

My assumptions are this, that Dr. Wyant 
has calculated that during the NHANES follow-up 
period 415 nonsmokers entered nursing homes and 259 
smokers entered nursing homes. 

I don't think that's an assumption. 

That's what the data reflects, right? 

That's my understanding, yes. 

And I have assumed that the 50/50 smoker/nonsmoker 
split for Minnesotans, which we see in your report, 
persists through all age groups. That is the 


16:39:52 

16:39:52 

16:40:00 

16:40:02 

16:40:06 

16:40:08 

16:40:08 

16:40:14 

16:40:16 

16:40:20 

16:40:22 

16:40:26 

16:40:32 

16:40:38 

16:40:42 

16:40:44 

16:40:46 

16:40:48 

16:40:56 

16:41:00 

16:41:04 
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1 assumption that I'm making. Okay? 

2 A. I don't know whether that assumption is even close 

3 to — 


4 

Q. 

Let's assume 

that 

it is . 


5 

A. 

I don't know. 




6 

Q. 

Let's assume 

it' s 

right. 

I want to understand how 


16:41:06 

16:41:08 

16:41:10 

16:41:12 

16:41:14 

16:41:16 
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7 


we do these calculations. 

16:41:18 

8 


Would this study that Dr. Wyant did then 

16:41:24 

9 


demonstrate that nonsmokers entered nursing homes 

16:41:32 

10 


at — well, at least a 50 percent greater rate than 

16:41:36 

11 


smokers? 

16:41:38 

12 


MR. HAMLIN: Objection; asked and 

16:41:40 

13 


answered. 

16:41:40 

14 


THE WITNESS: I wouldn't — I mean, I 

16:41:54 

15 


don't know what it would — why that ratio would be 

16:41:58 

16 


relevant. 

16:41:58 

17 

BY 

MR. SILFEN: 


18 

Q. 

All right. Let's just say at a greater rate. 

16:42:04 

19 

A. 

I can't conclude that. I don't know that that's the 

16:42:06 

20 


case. 

16:42:06 

21 

Q. 

And is the only reason because you don't know the 

16:42:08 

22 


prevalence in the community? 

16:42:10 

23 

A. 

That's one reason. 

16:42:10 

24 

Q. 

What's the other reason? 

16:42:12 

25 

A. 

I don't know who these people are. I don't know 

16:42:18 
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1 their ages. I don't know anything else about them. 

2 I mean, that data may be available in NHANES and 

3 might well be taken into account. 

4 Q. I'm not asking you anything about age. I'm asking 

5 whether the smokers, the nonsmokers in NHANES, 

6 entered nursing homes at a greater rate than the 

7 nonsmokers? 

8 A. Well, if you're asking me whether 415 smokers — 

9 nonsmokers entered and only 259 smokers entered. 


16:42:18 

16:42:24 

16:42:24 

16:42:24 

16:42:26 

16:42:30 

16:42:30 

16:42:36 

16:42:42 
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10 


that is to say if more nonsmokers entered, that's 

16:42:46 

11 


clearly the case in this data. 415 is greater than 

16:42:50 

12 


259; more nonsmokers entered than smokers. 

16:42:58 

13 


If you're asking me to reach some 

16:43:00 

14 


scientific conclusion based upon that fact, I'm not 

16:43:04 

15 


willing to do so. 

16:43:04 

16 

Q. 

Actually, I'm just trying to understand — now, on 

16:43:30 

17 


page 4 of your report, you said that smokers 

16:43:36 

18 


entering nursing homes during this period were far 

16:43:38 

19 


more likely, do you see that, than never-smokers to 

16:43:42 

20 


be suffering from lung cancer and chronic 

16:43:46 

21 


obstructive pulmonary disease, do you see that? 

16:43:48 

22 

A. 

I do see that, yes. 

16:43:50 

23 

Q. 

And do you see that the basis of that statement is. 

16:43:52 

24 


in fact, these reports, this chart? 

16:43:54 

25 

A. 

Let's see, we're talking about lung cancer for which 

16:44:04 


233 


1 


there were 15 persons who had lung cancer 1 

who were 

16:44:12 

2 


smokers and no persons who had lung 

cancer 

were 


16:44:16 

3 


nonsmokers, and then for COPD there 

were 109 


16:44:18 

4 


smokers — sorry, 109 COPD cases among the 

smokers 

16:44:26 

5 


and 51 among the nonsmokers. 




16:44:28 

6 

Q. 

And how many nursing home cases were 

there 

among 

the 

16:44:30 

7 


nonsmokers, 415? 




16:44:36 

8 

A. 

415 total, yes. 




16:44:38 

9 

Q. 

And how many nursing home cases were 

there 

among 

the 

16:44:40 

10 


smokers, 259? 




16:44:44 

11 

A. 

Right. 




16:44:44 
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12 Q. 


So are we then 


have we then reason to conclude 


16:44:48 


13 that during this period nonsmokers were far more 16:44:54 

14 likely than never-smokers to enter nursing homes? 16:44:56 

15 A. No, because — no. 16:45:04 

16 Q. Why not? 16:45:04 

17 A. Because I believe the basis for the sentence is 16:45:08 

18 reflected in the fact that among the persons who 16:45:18 

19 enter here, far more likely — yes, among the 16:45:26 

20 persons entering here, 12 percent had COPD among the 16:45:32 

21 nonsmokers, as opposed to 42 percent among the — 16:45:34 

22 Q. This is just another way of saying we need the 16:45:38 

23 prevalence number. But if we had the prevalence 16:45:40 

24 number and the prevalence number is indeed 50/50, 16:45:42 

25 wouldn't it be true that you could just as reliably 16:45:48 


234 


1 say that nonsmokers are far more likely to enter 

2 nursing homes than smokers, as you could make 

3 conclusions about lung cancer? 

4 MR. HAMLIN: Objection; asked and 

5 answered. 

6 THE WITNESS: No. 

7 BY MR. SILFEN: 

8 Q. Why not? 

9 A. Because my understanding of the sentence in 

10 paragraph 4 is referring to the fact that among 

11 those persons entering a nursing home, 259 smokers 

12 and 415 nonsmokers, 42 percent of the smokers had 

13 COPD, while only 12 percent of the nonsmokers did. 

14 And 6 percent of the smokers had lung cancer, while 


16:45:52 

16:45:54 

16:45:56 

16:45:58 

16:45:58 

16:45:58 

16:45:58 

16:46:02 

16:46:08 

16:46:12 

16:46:20 

16:46:24 

16:46:26 
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15 


none of the nonsmokers did. 

16:46:28 

16 

Q. 

Yeah, but if we assume that — 

16:46:30 

17 

A. 

I believe that's the point here. 

16:46:32 

18 

Q. 

If we assume there's 1,000 nonsmokers in the 

16:46:36 

19 


community and 1,000 smokers, we could just put them 

16:46:38 

20 


over a denominator and get the same result? 

16:46:42 

21 

A. 

No. No, that's not true. 

16:46:44 

22 

Q. 

Why? 

16:46:44 

23 

A. 

By taking that logic, which I've just used, to 

16:46:46 

24 


justify this second sentence in paragraph 4, the 

16:46:50 

25 


comparison is that 100 percent of the smokers went 

16:46:54 


235 


1 


into nursing homes, entered nursing homes, as 

16:46:56 

2 


compared to 100 percent of the nonsmokers, which is 

16:47:00 

3 


the same. 

16:47:00 

4 


So you're asking me whether it's the same 

16:47:02 

5 


thing, and I'm telling you that it's not. 

16:47:04 

6 

Q. 

I'll try once more. I want you to assume that the 

16:47:12 

7 


smoker/nonsmoker prevalence split in the community 

16:47:16 

8 


is 50/50 throughout all ages. 

16:47:20 

9 


Can you assume that, just a hypothetical? 

16:47:22 

10 


You must work with them all the time? 

16:47:24 

11 

A. 

I avoid them like the plague. 

16:47:26 

12 

Q. 

If you will not accept that, tell me now because I 

16:47:30 

13 


don't want to do — will you accept that 

16:47:32 

14 


hypothetical? 

16:47:32 

15 

A. 

No. 

16:47:32 

16 

Q. 

Why? 

16:47:34 
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17 A. Because I don't have any evidence for which to base 

18 that it's true. I mean, you're asking me to 

19 speculate about something I don't know about and I 

20 feel uncomfortable doing that. 

21 MR. HAMLIN: Let him finish his answer. 

22 THE WITNESS: You're asking me to 

23 speculate about something I don't know one way or 

24 the other, and that makes me uncomfortable as a 

25 person trying to give you accurate answers. 


16:47:36 

16:47:38 

16:47:40 

16:47:42 

16:47:42 

16:47:44 

16:47:48 

16:47:50 

16:47:52 


1 BY MR. SILFEN: 

2 Q. No, actually, you know, that really isn't a fair 16:47:54 

3 answer. 16:47:54 

4 I'm asking you to accept a predicate so 16:47:58 

5 that we can do an analysis of data we do have. 16:48:02 

6 Because I want to understand how the data works. 16:48:04 

7 So I'm saying a predicate. I'm asking you 16:48:06 

8 to accept as a predicate a 50/50 split, and then I'm 16:48:12 

9 saying, all right, then how do we interpret it? 16:48:16 


10 

Now, you will 

not do 

that? 

16:48:16 

11 

MR. 

HAMLIN: 

Objection; foundation. 

16:48:16 

12 

MR. 

SILFEN: 

I understand. 


13 

MR. 

HAMLIN: 

There's no factual foundation 

16:48:20 

14 

for your predicate. 

I mean, if you were going to 

16:48:22 

15 

present him with some 

data — 

16:48:24 

16 

MR. 

SILFEN: 

I have presented his own 

16:48:26 

17 

results that 

say smokers — 

16:48:28 

18 

MR. 

HAMLIN: 

No, no, present him with data 

16:48:30 


about prevalence, Tom, then there might be some 


16:48:32 
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20 


basis for the hypothetical. Right now 

you're asking 

16:48:34 

21 


him to speculate. 


16:48:34 

22 

BY 

MR. SILFEN: 



23 

Q. 

On page 2 of your report, it says that 

the 

16:48:38 

24 


smoker/nonsmoker prevalence split for 

Minnesotans 

16:48:42 

25 


age 19 and over is over 50 percent. 


16:48:44 


237 


1 


All I'm asking you to do is assume that 

16:48:46 

2 


that continues through the years. 

16:48:48 

3 


MR. HAMLIN: That chart does not deal with 

16:48:50 

4 


nursing homes, and you know that. 

16:48:52 

5 


MR. SILFEN: But that's not the issue. 

16:48:54 

6 


It's not how many people are in nursing homes, it's 

16:48:58 

7 


what the split is in the community. It's where they 

16:49:00 

8 


draw from, Tom. 

16:49:02 

9 


MR. HAMLIN: You're asking him a 

16:49:04 

10 


hypothetical about nursing homes. 


11 


MR. SILFEN: But, Tom, you were wrong — 


12 

BY 

MR. SILFEN: 


13 

Q. 

Isn't it true that the critical denominator is 

16:49:06 

14 


smokers and nonsmokers? Let's settle this. The 

16:49:10 

15 


denominator that you would want to know, the 

16:49:12 

16 


prevalence, would be the prevalence of smokers and 

16:49:14 

17 


nonsmokers in the community, right? 

16:49:16 

18 

A. 

For what purpose? 

16:49:18 

19 

Q. 

To determine whether smokers and nonsmokers enter 

16:49:20 

20 


nursing homes at different rates? 

16:49:22 

21 

A. 

I would not ask the question to smokers and 

16:49:26 
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22 

23 

24 Q. 

25 A. 


nonsmokers entering nursing homes at different rates 
without stratifying on the key known factors. 

Okay. Let's do this — 

The most important of which is probably age. 


16:49:30 

16:49:36 

16:49:36 

16:49:38 


238 


1 

Q. 

You have said — tell me what prevalence data you 

16:49:48 

2 


need in order to complete this analysis. 

16:49:52 

3 


Your report says Minnesotans 19 and over 

16:49:54 

4 


and it gives a prevalence data. Tell me what age 

16:49:58 

5 


group. Do I need smoking prevalence for Minnesotans 

16:50:00 

6 


55 and over and then you'd be able to do it? 

16:50:04 

7 

A. 

To do what? 

16:50:04 

8 

Q. 

To tell me whether smokers or nonsmokers during the 

16:50:08 

9 


period of NHANES entered nursing homes at a greater 

16:50:12 

10 


rate. 

16:50:12 

11 

A. 

I'm saying that I wouldn't ask that question. 

16:50:16 

12 

Q. 

I'm asking you the question, and I want to know what 

16:50:18 

13 


additional data you need. 

16:50:20 

14 

A. 

And I don't know what additional data I need because 

16:50:24 

15 


you're asking me to do something here on the fly. 

16:50:26 

16 


And when you do things on the fly like that, you 

16:50:30 

17 


make mistakes, and I don't want to make a mistake 

16:50:32 

18 


that I'll regret. 

16:50:32 

19 

Q. 

Okay. I wouldn't want you to make a mistake that 

16:50:38 

20 


you would regret. 

16:50:38 

21 


MR. HAMLIN: Can we go off for just a 

16:50:42 

22 


minute? 


23 


MR. SILFEN: No. 


24 


MR. HAMLIN: We can't go off? 
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25 


MR. SILFEN: Do you have another luncheon 


16:50:42 


239 


1 

date? 





2 


MR. HAMLIN: No. 

Off the 

record. 

16:50:46 

3 


(Off the record.) 



16:51:08 

4 


THE VIDEOGRAPHER: 

: This concludes the 

16:51:10 

5 

testimony 

of Scott Zeger. 

The time 

is now 4:51 

16:51:14 

6 

p .m. 




16:51:14 


7 

8 (The deposition was adjourned for the day.) 16:51:14 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 
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1 

STATE OF MINNESOTA) 

2 ) 

COUNTY OF HENNEPIN) 

3 

BE IT KNOWN THAT I, JENNIFER S. SATI, took the 

4 DEPOSITION OF SCOTT ZEGER, Ph.D., VOLUME I; 

5 THAT, I was then and there a Notary Public in 

and for the County of Hennepin, State of Minnesota; 


THAT, I exercised the power of that office in 
7 taking said deposition; 


8 THAT, by virtue thereof I was then and there 
authorized to administer an oath; 

9 

THAT, said witness, before testifying, was duly 

10 sworn to testify to the truth, the whole truth, and 
nothing but the truth, relative to this action; 

11 

THAT, said witness reserved the right to read 
12 and sign the deposition; 


13 

14 

15 

16 
17 


THAT said deposition is a true record of the 
testimony given by the witness; 

THAT, I am neither attorney nor counsel for, 
nor related to or employed by any of the parties to 
this action in which this deposition is taken and, 
further, that I am not a relative or employee of any 
attorney or counsel employed by the parties hereto, 
or financially interested in this action. 


18 


DATED THIS 13TH DAY OF SEPTEMBER, 1997. 


19 


20 


21 JENNIFER S. SATI, RPR, CRR 

Notary Public, Henn. County, Minn. 

22 My Comm. Expires January 31, 2000 

23 


24 


25 
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5 
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